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ABSTRACT 



This thesis provides the computer software to perform 24 
of the most common nonparametric tests, and a text 
explaining how and why to use that software. 

Nonparametric tests are valuable to experimenters and 
operations analysts for three reasons: ease of explanation 

to non- statisticians , simplicity of coaputation, and appli- 
cability to data sets which cannot be analyzed by parametric 
tests. Reasons for unsuitability of parametric test include 
data of nominal or ordinal level of measurement, lack of 
common variance, or lack of normal distribution of the 
underlying population. 

There are two programs provided: Lochinvar, an interac- 

tive Pascal program, and Crunch, a Fortran program. 
Lochinvar is designed to prompt and screen a user's entry of 
data and options. Crunch performs calculations; it is suit- 
able for use in whole or parr as a subcomponent of other 
programs. 
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I, INTRODUCTIDS 



This thesis is intended to help non-statiticians use 
nonparametric tests to their fullest. The text explains 
when nonparametric tests are preferable to parametric tests, 
how to choose the most appropriate test, and how to use the 
nonparametric software package. 

Parametric tests are often the preferred form of statis- 
tics because of their acceptance in the academic world, the 
availability of computer support, and flexibility in model 
building. Nonparametric tests provide a valuable alterna- 
tive to parametrics, because they may be applied to data 
which, for any one of a number of reasons, cannot be 
analyzed with parametric tests. The simplicity of nonparame- 
tric tests often make the test statistics easier to compute 
and test results easier to explain. Section II gives a 
detailed comparison of parametric versus nonparametric 

test s . 

When confronted with the same hypothesis, various para- 
metric tests give consistent results because each test 
extracts the same information from the data set, 
Nonparametric tests may yield varying results with the same 
data, because different tests extract different information 
from the data set. To overcome this potential problem, the 
most appropriate nonparametric test should be chosen to 
analyze the data set. Section IV gives guidance on 
selecting the best test. 

In order to make using nonparametric tests as easy as 
possible, two programs were written to perform the most 
common 24 nonparametric tests. A general description of how 
to use the programs is contained in section III. Sections V 
through XXVIII contain descriptions of each of the tests. 
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and detailed instructions on their use. The appendices 
contain more detailed information about the programs. 
Appendix A is a tutorial which depicts use of the programs 
to perform a test on a data set. Appendices B through E 
describe the two programs and two files. Appendix G 
contains the interactive Pascal program, Lochinvar, while 
Appendix F contains the Fortran program Crunch. 
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II. COMPA RIS ON OF P ARAMS! RIC VERSOS NONP A RA MET R I C TESTS 

This section contrasts the nonparaaietric statistical 
test with parametric statistics. To do this, there is a 
short summary addressing various levels of measurement, and 
a brief discussion on the relationship between the normal 
distribution and interval data. Finally there is a compar- 
ison of the advantages of the two classes of tests. This 
will help the reader decide whether to use the tests of this 
text . 

A. LEVELS OF MEASOREMENT 

Statisics manipulate raw data to extract whatever infor- 
mation that data contains. Level of measurement indicate 
how much information is contained in the data, and how that 
information may be manipulated. Although formally there are 
four levels of measurement, as defined by Stevens [Ref. 2], 
there are only three which matter when choosing the test: 
nominal, ordinal, and interval. As a consistent rule any 
information contained in data of a lower level of measure- 
ment is contained in data of a higher level; any manipula- 
tion permitted on a lower level is permitted on a higher 
level . 

Nominal data elements contain very little information: 
only membership in a subset of the total data set can be 
determined. Manipulation of the nominal data sets is 
limited to counting the number of elements in each subset. 

Ordinal data elements contain information relative to 
other elements: element A is either less than, greater than, 
or equal to each other element in the sat. However, the 
question "greater than by how much" cannot be answered. The 
data set may be ranked. 
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Interval data elements contain more information, 
only can element A be compared to other elements, but 
inequalities can be quantified; e.g., A>B implies that there 
exists a k such that A=B+k, and that the k can be found by 
k=A-B. The data set can be manipulated with normal arith- 
metic operations, such as addition, division, or raising to 
a power. 
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Nonparametric tests make no assumptions about the under- 
lying distribution of the population from which the data is 
drawn. Additionally, there are nonparametric tests to 
analyze data with nominal level of measurement or higher. 
Because some nonparametric tests reguire only nominal or 
ordinal data, and don’t make assumptions about population 

distributions, the power of those tests may be less than 
that of parametric tests — assuming that the data set justi- 
fied use of parametrics in the first place. 

C. ADVANTAGES AND DISADVANTAGES OF CLASSES OF TESTS 

The advantages and disadvantages of parametric tests are 
the mirror image of those of nonparametric test. Therefore, 
the advantages of each class of tests is given. 

1 . A dvant a g es of Pa ra m etric Tests 

The general advantages of parametric tests are wide 
acceptance of methodology, consistency of results, and capa- 
bility to handle multiple variables in systematic model 
building. 

a. wide Acceptance 

The wide acceptance of parametric test has a 
number of advantages. Statistical software packages contain 
a very complete set of parametric tests, and only a small — - 
although increasing over time — number of nonparametric 

tests. Table of the theoretical distributions needed to 
judge test statistics (the normal, student’s t, chi-sguare 
and F) are extensive and ubiquitous. The terms of the test 
are known to a majority of persons trained in the social or 
physical sciences; whether the teems are fully understood 
may not influence the value of their recognition. 
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b. Consistency of Results 

Different parametric tests give the same conclu- 
sions, given the same null and alternative hypotheses. 
Parametric tests all extract the same information from the 
data, and merely rearrange it for different purposes. The 
parametric distributions are all derived from the normal.' 
With related test statistics and similar test distributions 
for critical values, it is not surprising that different 
techniques give consistent answers to the same question. 

Non para me tries may provide different results to 
the same question on the same data set, because different 
tests extract different types of information from the data 
set. For example, with paired dara elements, the sign test 
considers only which member of the pair is larger; the 
signed rank test needs to know not only which is larger, but 
also the magnitude of that difference. Consistency of 
results is therefor a possible advantage for parametric 
tests. However, this advantage can be minimized by 
selecting the best (most powerful) nonparametric test 

possible, as discussed in a later section. 

c. Versatility in Model Building 

The final, and probably most important, advantage 
of parametrics is their versatility in explaning very 
complex interactions in data. Analysis of variance can test 
differences in treatment effects, multiple blocking effects, 
and interaction among treatments and blockings. Regression 
can be made to do analysis of variance, (by using nothing 
but dichotomous variables) , but is even more flexible, by 
building models using variables with continuous values 
rather than only a fixed number of treatment levels. 



' The t is a normal distribution in which the lack of 
knowledge of the population expected value is corrected for 
by lowered kurtosis; the chi-square distribution of degrees 
or freedom n is the sum of squares of n standard normal 
distributions; the F is the ratio of two chi-squares. 
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The advantages of nonparametric tests are simplicity 
and applicability. 

a. Simplicity 

Nonparametric tests* simplicity allows for rela- 
tively easy calculation of the test statistic and critical 
value for a null , hypothesis. This simplicity also allows 
nonparametric tests to be explained to, and defended in 

front of, persons who are not trained in statistics. 

The test statistic for most nonparametic tests 
can be calculated by hand. For example, a set of 20 pairs 
of data elements (X,Y) can be compared for equality of 
expected value (E(X)=E{Y)) by either the t-test or the 
nonparametric sign test. The t-test needs sample means and 
variances, so some arithmetic operations are required: four 

sums of 20 terms each, 40 squaring of numbers, four divi- 
sions. In contrast, the sign test requires counting in how 
many pairs the X element is greater than the Y element. The 
contrast is not always so one sided; as the amount of infor- 
mation extracted from the data increases, so does complexity 
of computation. For example, rank-order nonparametric tests 



14 



require ranking of data elements, which can become tedious 
in large data sets. However, the process of ranking tends 
to be self-'corr ecting, because an uncanked or misranked dara 
value will stand out in a simple sequential check of the 
ranks. Arithmetic mistakes in squaring, summing, et cetera, 
may not be so obvious unless the operation is redone. When 
computations are made by hand, therefore, non parametric rest 
statisistics are easier to find. 

Since hand-held calculators are even more ubiqui- 
tous than parametric tables, this advantage of nonparamet ric 
test is less important than it once was. If the number of 
keystroke errors is proportional to the number of numbers 
entered in the calculator, though, the simplicity of nonpar- 
ametric tests may still be an advantage. 

Simplicity could also be an advantage in the rare 
case when tables of distribution are not available. Many 
nonparametric test distributions, especially the simpler 
tests, rest on the binomial distribution. For small sample 
sizes, the binomial can be calculated by hand; with a small 
calculator, a distribution for sample size of 20 or 30 can 
easily be calculated. The normal, student's t, chi-square 
and F distributions lack closed forms for their cumulative 
distribution functions. Numerical integration--even if the 
density functions were remembered or available--would be 
challenging with a calculator (since all but the normal 
require one or more gamma functions! and probably is beyond 
the realm of paper and pencil mathematics. 

Simplicity is also a virtue in explaining test 
results. Comparison and rankings are fairly easy to follow 
(although sometimes difficult to perform) , while the arith- 
metic operations of parametric tests are less transparent. 
Most non-statisticians have had experience in tossing coins, 
rolling dice or some games of chance. Analogies can be made 
from those experiences to the evaluation of tesr results. 
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In the previous example of 20 pairs (X,Y) , with the sign 
test producing a test statistic of 4 (i.a., in four pairs, 

X>Y) , an analogy could be made to the odds that a coin were 
fair if it came up heads four times in 20. Explaining that 
same data set with the t-test would require mention of stan- 
dard deviations and pooled variances, and perhaps drawing of 
overlapping bell-shaped curves. An audience predisposed to 
disagree with the test results might be hard to convince 
with parametric tests. 

b. Applicability 

Noaparametric tests can be used on a larger 
number of data set than can parametic tests, because of the 
lack of assumptions about the distribution of the underlying 
population, and the ability to handle ordinal or interval 
data. This advantage is mors vital in the "real world" than 
in academia, for three general reasons: 

(1) An analyst is sometimes handed data sets to 
do something with; for any one of a number of reasons, para- 
metric tests may not applicable. Host commonly., the form in 
which the data was collected in may fail to meet the under- 
lying requirements of para metrics, but meets the require- 
ments of nonparametr ics. 

(2) The cost of gathering interval data is 
prohibitive, while ordinal or nominal data can be gathered 
at low cost. For example, on a field exercise, uncalibrated 
eyes can generally judge distances greater than or less 
than, but some sort of instrumentation must be used to 
determine the exact magnitude of the difference. The expen- 
sively equipped national test ranges are a response to the 
need for very accurate interval level of measurement data. 
Laboratories, too, are often equipped for fine metrics. But 
in field or operational tests, ordinal level of measurement 
may be the highest level available. 
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(3) The experiment was designed to gather 

interval data, from populations with common variance, but 
something went wrong. Instrumentation sometimes fails. 
Even if the measurement is accurate, variances may not be 
common or the data may not be transformable to look normal. 
Rather than giving up, or repeating the experiment, nonpar- 
ametric tests may answer some questions originally to be 
answered by analysis of variance or regression. 
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III. THE COMPUTER PACKAGE 



A. GENERAL 

This section gives a brief description of the computer 
package. Tests are described in sections V through XXVIII, 
while the Appendices describe the programs and files in 
detail. 

The computer package consists of two computer programs, 
Lochinvar and Crunch. Also involved are three types of CMS 
files: data files and an options file, written by Lochinvar 

to be read by Crunch; and output files, written by Crunch, 
containing test results. 

B. LOCHINVAR 

The program Lochinvar (LOad CHoioes and INput VARiables) 
is the user's main contact with the programs. It is inter- 
active, prompting and screening the user's entries. Its 
purpose is to writs two types of files needed by the program 
Crunch: the data file, and the option file, 

1 . Data Entry 

Lochinvar allows the user to input data in two sepa- 
rate ways. The user must decide which one to use before 
starting the program. 

a. Method I: Data Values into a Data File 

Whan the data is numerical, the user should 
strongly consider this method. Lochinvar prompts entry of 
data set and variable names, and of the data set values 
themselves. All that information is sent to a file on the 
user's A disk; the user chooses one of six files, "lochi 
a" , . . . , "lochi f, into which the data is written. Unless 
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the data will be available for 



erased or over- written, 
repeated use. 

All 24 of the data tests in this text can compute 
test statistics from data files. However, a data file 
itself is not enough to perforin a tesr.; the option file is 
needed to reference the data file and communicate the choice 
of test and test parameters to Crunch. After a data file is 
written, Lochinvar prompts the writing of the options file, 
as described in subsection 2, below. 

b. Method II: Cell Counts in the Options File 

When the data is non -numerical , this is the user's 
only oprion; it may be more convenient even if the data is 
numerical. Four of the n onparametr ic tests handle nominal 
data --binomial , McNemar, chi-square test for independence, 
and Cochran. Instead of entering all the data values, the 
user may run one of these tests by entering the appropriate 
count of elements which fall into a test's subcategories. 
For example, the binomial test can be run on a large data 
set by simply entering two numbers: the count of "successes” 
and the count of "failures" . 

In this method of data entry, Lochinvar prompts 
entry of cell counts at the same time it is prompting the 
test selection and choice of parameter. All that informa- 

tion is written to the options file; no data file is 
written. 

2* The Options F ile 

The purpose of the options file is to tell Crunch 
everything it needs to hnow to perform a nonparametric test. 
In all cases, the options file contains the user's selection 
of which test to be performed, the parameters to be used in 
that test, and to which CMS output file the output should be 
written. If the data values are to be used (Method I) , the 
cption file states which data file Crunch should read. If 
cell counts are used (Method II) , the options file contains 
those counts, as well as the data set name. 
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The options file is described in detail, with an 
example, in Appendix D. 

3 • Dsinq Lochin var 

In CMS, type "LOCHI". The program will display a 
sequence of questions; the user's response will depend on 
the method of data enty and test selection, 
a. Use of Previously Enter Data 

The program asks "do you wish to perform a test 
on a previously entered data set?", i.e., one of the six CMS 
data files "lochi a" ,"lochi f". 

If the answer is yes, then Lochinvar asks which 
data set to use. The program reads the data set's name, and 
displays it to the user, and asks "Is this the right data 
set?" If yes, the user indicates so, and the test selection 
is next. If no, the the user must choose another file. 

If the answer to the question of previously enter 
data is no, then Lochinvar asks wherher data entry is by 
data values (Method I) or cell counts (Method II) . If the 
user wants to enter cell counts, the next step is to select 
a test. If the user wants to enter a data set, the program 
asks : 

(1) To which file the input data is to be written 

(2) What is the data set name, number of vari- 
ables and variable names. 

The program will then prompt entry of data 
values. If there are two variables in the data set, the 
user will be offered the choice of entering data pair by 
pair; this will only work if there are an equal number of 
elements in each of the two variables. Normally, all the 
data elements of the first variable will be entered 
together, then all of the second variable, et cetera. After 
entry of each variable, Lochinvar will offer the user the 
chance to check, change or add data elements to the values 
just entered. 
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C. CRUNCH 



1 • E xecuti ng the Non pa r ametri c rest 

After the options file has been wrirren, the user 
executes the test by typing "CRUNCH” while in CMS. 

2 • Actions of the Pr ogr am Crunc h 

Crunch reads the options file, and when necessary, 
the designated data file. It performs the nonparametric 

test requested, and reminds the user with a display on the 
screen to which output file the results have been writren. 
Crunch actually writes three files: 

a. The Output File 

The output files are named "u listing", "z 

listing". At the top of of the file is the label, 

consisting of the name of the data set, the number of vari- 
ables in the data set, the number of variables used in the 
test (if less than the number in the whole data set) , and 
the names of the variables used in the test. Following the 
label is the name of the test performed. The rest of the 
output file's format varies, depending on the test chosen. 
In general, the output file contains only the most salient 
results of the computations. All output files contain 

either a test statistic, or a test statistic and the empir- 
ical level of significance for the desired null hypothesis. 

b. The Problem Listing File 

If there are any problems encountered by the 
computer that result in Fortran error or warning messages, 
such messages will be written to the CMS file "problem 
listing" . 

c. The Nptest Listing File 

This file contains intermediate computations of 
the program as it works towards the test statistic. For 
example, the Smirncv test requires finding differences in 
probabilities of two empirical distributions. Since the 
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test statistic is simply the largest absolute difference, 
the output file will only list the largest positive and most 
negative difference, as well as the test statistic. "Nptest 
listing", among other things, will include each difference 
between the empirical cumulative distribution functions 
(cdf's). This listing file is not arranged to be attractive 
or suitable for submission, but it is labelled, and may be 
of some use in understanding the operations of Crunch. 

D. ADDITIONAL FEATORES OF THE PACKAGE 

To avoid overwriting input or output files, six of each 
were provided. If more are needed, the user can rename some 
of the existing files. 

Lochinvar is a convenient way to enter data by hand. 
When data is generated from other programs or read from dish 
or tape, the user may want to create his own data file 
rather than use Lochinvar. Detailed instructions for this 
are contained in Appendix C. 

Crunch is written in Fortran. Users who are experienced 
programmers nay want to modify it to suit their own tastes, 
or adapt portions of it into their own programs. An expla- 
nation of the program is contained in Appendix E. The 
program is listed in Appendix G. 

E. CONSIDERATIONS IN DATA ENTRY 

Lochinvar is written in Pascal, a convenient language 
for interactive programs, but a language which may be unfa- 
miliar to many users. This section provides sufficient 
insight to Lochinvar so that the user can efficiently commu- 
nicate choices to the program. 
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1 . Entry of Nu mbers 

When entering numerical data, Lochinvar requires 
that the first character read is either a numeral or a minus 
sign followed by a numeral. Therefore, to enter a number 

between minus one and plus one, precede the decimal with a 

••O'*. 

When entering more than one number ar a time, leave 
a space (not a comma) between the numbers. 

If Lochinvar expects a number and gets anything 
else, it will halt with an error. This is a convenient way 

to get out of the program quickly, but anything entered on 

that run of Lochinvar will be lost. The one exception is 
when entering a data set of data values and an options file 
in a single run. After the data set is entered, the program 
writes it to the designated file, and asks "now, do a test 
with this data?" Any subsequent errors will not effect the 
data file itself, just erase all additional entries. 

2* Entry of Cha racter s 

when the program prompts for a single letter 
response, e.g. , "enter Y or N", enter "Y" or "N" and press 
the enter key. The program expects and will read only the 
first letter. Single letter entries are contained in 

"repeat. .. until" loops which will not allow the user to 
procede until one of the offered choices is made. 

Do not use the alternate character (apl) setting on 
the keyboard while using Lochinyar. 

When Lochinvar prompts for the data set name or 
variable names, the user must enter from one to 48 charac- 
ters. These names are one of the few things for which the 
program does not offer a chance to correct an error. The 

user must be careful, checking the names prior to hitting 
the "enter" key. If an error is made anyway, the data or 
options file can be corrected using Xedit in CMS. 
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If th= p 
one numeral or c 
in "do while not 
key is pressed 
with an error, 
of the line, 
entries will be 



rogram prompts for an entry, enter at least 
haracter. The interactive read commands are 
eoln " (end of line) loops; if the enter 
with nothing to enter, Lochinvar will halt 
because it attempted to read beyond the end 
As with numerical entry errors, previous 
lost . 



IV. ROSTER OZ AVAILABLE NONPA RAHETRIC TESTS 

A. CRITERIA FOR SBLECTIN3 THE MOST APPROPRIATE TEST 

In selecting a nonparametric test, the user must 
consider three criteria; purpose of the test, data set level 
of measurement, and whether the data elements are related 
(i.e., paired or blocked). 

B. CRITERION I: PORPOSE OF TEST 

Nonparametric tests can be successfully used to test for 
questions of location, inde pendence/interaction, and distri- 
bution. Table 1 shows a matrix of the nonparametric test 
with level of measurement (LOM) oi the horizontal axis and 
general purpose on the vert ical. 

Location usually refers to whether the expected value of 
one population is equal to that of another, based on sample 
variables. Other purposes are testing for proportion of 
successes or for value of a single population's quantile. 
Table 2 lists the location tests with their null hypotheses, 
to help the user narrow the choice set. 

Independence of two samples refer to statistical inde- 
pendence, i.e., correlation of zero. Interaction may be a 
before/after effect, or prediction of one variable by 
another. Table 3 lists these tests by their specific null 
hypotheses. 

Some tests for distribution check for equal variance 
among variables. other tests check a single variable's 
empirical cumulative distribution function (ecdf) against a 
theoretical distribution, with either hypothesized or esri- 
mated parameters. Still other tests compare the empirical 



26 



TABLE 1 

Nonparametric lasts 

Pu rpos e, L evel s of Measure meats: 

Hinimul # — — - — _ 



7arsT ~ 


Interval 


Ordinal 


Nominal 


LOCATION: 








1 var 




Quantile 


Binomial 


2 var 


Wilcox on* 


Sign* 

Mann-Whi tney 




>2 var 


Quade* 


Median 

Friedman* 

Kruskal 

-Wallis 


Cochran 


INDEPENDENCE/ 

INTERACTION; 






1 var 




Cox-Stua rt 


McNemar 


2 var 


Corre lation 
NP Regression 
Monotone 


Cox-stuart * 


R*C Contingency 
(chi-square) * 



Regression 



DISTRIBUTION; 

1 var Kolmogorov (N<31) 

Goodness of Fit 
(chi-square ,N> 14) 

Mil/ces-Shapir o (N<31| 

Lilliefors (N<3i) 

2 var Smirnov 

Cramer-von Mises 

>2 var Squared Rank 

Hartley 

* indicates that variables must be paired or blocked 



cdf ' s of two variables aga 
list of these tests, with t 
urement, and other consider 
The decision matrix ind 
these purposes. The speci 
checked to be sure that the 



inst each other. Table 4 has a 
heir hypotheses, levels of meas- 
ations . 

entifies tests suited to each of 
fic section on the test should be 
test will do what is desired. 
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TABLE 2 










1 


Nonpar ametric Tests for 


Location 






HYPOTHESIS 


NAME # VARS 


LOM 


SEC 


PAGE 




P (success) =P ♦ 


Binomia 1 


1 


N 


V 


30 




P (X<x*) =P* 


Quantil e 


1 


0 


VI 


35 




E (X) =E(Y) 


Sign . . 


2* 


0 


IX 


47 




Mann-Wh itney 


2 


0 


VII 


40 






Milcoxon 


2 * 


I 


XI 


65 




E (H) =.. .=E (Z) 


Cochran 


> 2 * 


N 


XXIII 108 




Medi an 


>2 


0 


XVII 


83 






Kruskal-Hallis 


>2 


0 


XVIII 88 






Friedma n 


> 2 * 


0 


XXI 


99 






Quade 


> 2 * 


I 


XXII 


104 




♦ indicates that variables must 


be 


paired or 


blocked 








TABLE 3 












Nonpar ametric 


Tests for Inda pendsnce/Interaction 




HYPOTHESIS 


NAME # VARS 


LOM 


SEC 


PAGE 




Trend (V. order) 


Cox- Stuart 


1 


0 


VII 


43 




Trend (X v. Y) 


Cox- Stuart 


2 * 


0 


VII 


43 




Before/After 


McNemar 


2 * 


N 


X 


51 




rho(X,Y)=0 ^ 


R*C Contingency 
chi-square test 


2 * 


N 


XI 


56 






for indepencence) 
NP Correlations) 


2 


0 


XIV 


69 




E (Y1 X) =a+b*X 


NP Regression 


> 2 * 


I 


XV 


74 




b#0 


Monotone Regress 


>2 


I 


XVI 


79 




* indicates that variables must 


be 


paired or 


blocked 




# gives pearson's and Spearman's 


rho, Kendall ' 


s Tau 


— J 
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TABLE 4 

Nonparametric Tests for Location 



T 



HYPOTHESIS 



NAME 



# VARS 



LOM SEC 



PAGE 



var (W) = 

. . .=var(Z) 

Fx(X) =G*.(x) 
(specified 
parameters) 

Fx(X) =G«(X) 
(esrimated 
paramete rs) 



Fx (X) =GX (X) 
F,G: 

empirical cdfs 



Hartley >2 I 

Squared Rank >2 I 

Kolmogo rov (N<3 1) # 1 I 

Goodness or Fir# 1 I 

(chisguare,N>14) 

Lilliefors (N<31) 1 I 

(norma 1, exponent ial) 
Goodness of* Fir# 1 I 

Wilkes- Shapiro 1 I 

(normal,N<31) 

Smirnov n 2 0 

Cramer-von Mises 2 0 



XX 

XIX 

XXIV 

XX 



XXV 

XII 

XXVI 



96 

92 

113 
6 1 



1 1 8 

6 1 
121 



XXVII 124 
XXVIII 127 



# tests normal, uniform /exponential, weibull, erlang 



C. CRITERION II: LEVELS OF MEASUREMENT 

Levels of Measurement were treated in section II. 

D. CRITERION III: RELATIONSHIP AMONG DATA ELEMENTS 

Data set variables may be unrelated, or paired (if 2 
variables) or blocked (if more than 2 variables) . When 
unrelated, the order of variables in the first variable is 
not dependent on the order of of all other variables; e.g. , 
the data elements of a variable could be entered in any 
order. In data sets with unrelated variables, the number of 
elements do not have to be equal. 

Paired or blocked data sens require sample sizes to be 
equal and the order of the elements of the variables 
matters. Examples of pairing mighr be responses of husbands 
and wives, the intelligence of twins, the condition of rhe 
same experimental subject before and after the experimental 
treatment. 
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E. SELECTING THE BEST TEST 

As mentioaed in section 
tests applied to the same 
different results to the sa 
the sign test and the wilco 
given in section II--that t 
information from the data- 
decision matrix. In genera 
etric test is one that tes 
uses all the available inf 
Using all the available inf 
which requires the level o 
using a paired or blocked t 

These rules require th 
might want to know if pro 
than procedure Y. At fir 
location; isE(X)=E(Y)? 
ered a problem of distribut 
for distribution would ch 
across the entire range of 
of the information of the 
with small data sets. 

The computer package, d 
makes running multiple tes 
easy. The user can explor 
tests on the same data, to 



II, zvo different nonparametric 
data set could conceivably give 
me hypothesis. The example was 
xon signed rank test. The reason 
hose two tests extract different 
-underscores the reason for the 
1, the most appropriate nonparam- 
ts exactly what is desired, and 
ormation contained in the data, 
ormation implies selecting a test 
f measurement of the data, and 
est if appropriate, 
ought. For example, the user 
cedure X produces better results 
St glance, that is a problem of 
However, it could also be consid- 
ion: is Fx(z)=Fy(z)? The test 

eck the empirical distributions 
observations, and thus use more 
data set, particularly important 

escribed in the previous section, 
ts on the same data set fairly 
e the results of using different 
get insight into his problem. 
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7. TEST 



A. OVERVIEW 

1 . P urpos e 

To characterize p, a population's proportion of 
"successes", based on the sample. 

2. Level of Mea s ure me n t 
Nominal 

3. A ssump tions 

The sample is randomly drawn from the population. 
Data may be divided into any dicotoaous groupings, i. e. , 
two mutually exclusive and collectively exhaustive groups. 
Here, the groups are called "successes" and "failures". 

4 . H vpoth es es 

Let p* be a specified number 0<p*<=1. 

a. Two tailed test 

(1) Null hypothesis hD: p=p* 

(2) Alternative hypothesis hi: p^^p* 

b. One tailed test 

(1) Null hypothesis hD; p^p* 

(2) Alternative hypothesis hi; p>p* 

c. One tailed tesr 

(1) Null hypothesis hD: p>p* 

(2) Alternative hypothesis hi: p<p* 



B. ACTION REQOIHED EY OSER OF PROSRAM 

1 • De cis io ns about P rog raa Opti ons 

The user specifies which treatment of the data set 
is used for the test, a hypothesized proportion of successes 
(p*) , and the desired level of significance (alpha*) . The 
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program uses the count of successes and failures; there are 
three ways to indicate that dichotomization: 

a. The user enters cell counts. This is the only way 
to handle ncn-numerical data, but will work for any data. 

b. The user dichotomizes the data set by a user 
specified list. This is the way to handle numerical, 
nominal data. Data values which are elements of the speci- 
fied list are in one set; the resr of the data values are in 
the other set. The user indicates whether the list defines 
success or failure. 

c. The data set is divided in two by a partition 
value. This is only possible if the data is numerical and 
at least ordinal. All elements above the partition value 
are in one set; the user defines whether that sat is success 
or failure. The values below the partition are the other 
type. 

2- Pro gra m Acti vat ion w it h Cell Counts ( cho ice a a bo ve ) 

a. Initiation and test selection 

In CMS, type LOCHI. Enter ”N" for no to question 
about previously entered data. Enter "C* for cell counts. 
Enter "0" ," V", . . .”Z" for desired output file. When offered a 
choice of tests, enter 1. 

b. Cell count entry 

Lochinvar will ask the number of successes and 
number of failur es--enter those cell counts. 

c. Parameter entry 

Lochinvar will ask for a hypothesized proportion 
of successes P*, and a desired level of significance, 
alpha*. When prompted, enter those numbers 

3. P roqra m Act i vati on wit h Da^a Values (choice b or c) 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 
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In CMS, type LOCHI. If data is already in a data 
file, enter "T” to first question, then enter file name; 
when offered a test, type 1. If data needs to be entered, 
enter "N" for first question, "V" for second; when data 
entered, - choose to do a test; when offered a choice of 
tests, type 1. Enter " 0", "7",. . . "Z” for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar; 

(1) Enter index of variable to be used in test. 
Enter whether dichotomizing by list or by partition value 

(2) If by list, enter the number of elements in 
that list. Enter elements of list. Enter ”Y", when 
prompted if elements on that list represent success. 

(3) If by partition value, enter that value. 
Enter ”Y" if data elements below that value are successes. 

• Te st E x ecu t i cn 

To perform the test , after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5. O utput A vailable to the User 

The program CRUNCH displays the data file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and the variable used in the test are at the top of 
the output file. The empirical level of significance, 
alpha-hat, and confidence interval limits, are listed. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Descr i pti on of Cal culat ion of Level of Significance 
an d Int ervals 

a. Count number of successes. Divide by the total 
number of elements to give p-hat. 

b. If N<20, generate a binomial probability mass 
function with parameters N and p-hat. Otherwise, use the 
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with parameters 



normal approximation to the binomial, 

N*p-hat and N*p-hat* (1 - p-hat). 

c. Compute tail probabilities 

(1) The upper tail probability is the likelihood 
that the number of successes is greater than or equal to the 
sample number of successes. 

(2) The lower tail probabilty is the likelihood ■ 
that the number of successes is less than or equal to the 
sample number of successes. 

d. Compute the level of signficance, alpha-hat 

(1) if hO: p^p* , alpha-hat equals the upper tail 

probability. 

(2) if hO: pip*, alpha-hat equals the lower tail 

probabiltiy 

(3) if hO; p=p*, alpha-hat equals two times the 
minimum of the upper and lower tail probabilities 

e. Compute the interval, N less than 21 

(1) Let y1 = number of successes minus one, = 

index of the pmf of the lower tail probabilty. Vary p*, 
0<p*<=p-hat, until the lower tail probability equals 

alpha*/2. ?* is the lower confidence limit. 

(2) Let y2 = number of successes plus one, 

=index of the pmf of the upper tail probability. Vary p*, 
p-hat<p*<=1, until the upper tail probability equals 

alpha*/2. This p* is the upper confidence limit. 

f. Compute the interval, N greater than 20 

(1) Compute the standard error of estimate by 
taking the square root of the quantity (number of successes 
minus the number of failures, divided by N to the third). 

(2) Find the value z, which is the desired quan- 
tile, l-alpha*/2, of the standard normal disrribution 

(3) The lower confidence limit is p-hat minus z 
times the standard error. The upper confidence limit is 
p-hat plus z times the standard error. 
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2* Program Ope ra tions 

a. Determine the cell count 

b. If N is less than or equal to 20 

(1) Generate a vector of N choose k, k=0,1,...,N. 

(2) Generate a binomial pmf, with parameters N 
and p-hat=number of successes divided by N. 

(3) Compute the tail probabilities 

(4) Compute the empirical level of significance 

(5) Using the previously generated N choose k 
vector, generate partial binomial pmf's, with terms 0 to 
number of successes minus one, or number of successes plus 
one to N. A bisection search technique varies p, 0<p<=p-hat 
or p-hat<=p<=1, until the sum of the terms of the partial 
pmf is within 0.0001 of the desired level of significance 
(alpha*/2, because the interval is two tailed). 

c. If the sample contains more than twenty elements 

Use the normal approximation to the binomial. 
The subroutine Muvar uses numerical integration to find tail 
probabilities; from which the empirical level of signifi- 
cance is estimated. Muvar also funishes the z quantile for 

computing the confidence interval limits. 

3 • P rogra m Char acte ri s tie s 

a. Limitations 

Maximum number of data points; 400, if the 
program computes the cell count from a data list; unlimited 
if the user enters the cell counts directly. Maximum number 
of variables: 1. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Binome , Which_vec,Hyp 

(2) Crunch: Bintest; Muvar, Cibi21 or Nchuzk, 

Binpmf, Cibi20, Bisex 

(3) External: Sh sort 
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VI. 22MTILE TEST 



A. OVERVIEW 

• G enera l Purp ose 

The quantile test addresses the question of what 
proportion of the population is less than or equal to a 
specified value. An example would be that a student might 
be concerned less with the average grade on a test than the 
proportion of students who received at least the minimum 
passing grade. 

2* Level of Measurement 

Ordinal 

3. A ssump ti on 

The data set is a random sample from the population. 

4 . H ypoth eses 

Here, p* is the specified population proportion, 
0<p*<=1, and X* is a specified value wirhin the range of the 
population. 

a. Two tailed test 

(1) Null hypothesis hO: 
the p*th quantile = x* 

(2) Alternative hypothesis hi: 
the p*th quantile x* 

b. One tailed test 

(1) Null hypothesis hO: 
the p*th quantile > x* 

(2) Alternative hypothesis hi: 
the p*th quantile < x* 

c. One tailed test 

(1) Null hypothesis h3: 

the p*th quantile < x* 
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(2) Alternative hypothesis hi: 
the p*th quantile > x* 

B. ACTION REQUIRED BY USER OF PROGRAM 

1 • U ser Decisio ns abou t P rog ram Op tio ns 

The user must specify the values for p* and x*, and 
the letter of the hypothesis desired. 

If there is more than one variable in the designated 
data set, the user must specify which is to be used. 

2. P rogra m Activation when Using Data Values 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter ”Y*' to first question, then enter fils name; 
when offered a test, type 2. If data needs to be entered, 
enter "N” for first question, "V” for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 2. Enter " U", "V”, . . . ”Z" for desired output 
file . 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter index of variables to be used in test, 
if there is more than one in the data set. 

(2) Enter the values of p*, x* , the letter of 
the chosen hypothesis, and the hypothesized level of signif- 
icance, alpha*. 

3 • Tes t Ex ecu tion 

Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

** • Output Avail able to the User 

The program CRUNCH displays the data file to which 
the data has been sent, e.g., "U LISTING". The names of the 
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data set and variables used are at the top of the output 
file. Also included are the test statistic (the number of 
data elements less than or equal to x*) , the level of 
significance, and confidence limits for the population's 
p*th quantile. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Calcu lation of Level of Signifi ca nce and Conf i denc e 
Limi ts 

a. Compute the Test Statistic 

Crunch compare the data elements with x*, 
counting the number which are less than or equal to x*. If 
there are no ties, T1 = T2 = the count. If some elements 
are equal to x*, then T1 = the count of data elements less 
than X*, and T2 = T1 plus the number of ties. 

b. Computation of the Level of Significance 

If T1 = T2, the computation of the level of 

significance is exactly the same as the binomial test, with 
T1 as the number of successes, the total number of data 
elements as n, and p* as the p* of the binomial test. If 
T1 < T2, then the lower tail probability is calculated using 
T1 ; the upper tail probability uses T2; and the computation 
procedes as with a normal binomial test. See section V for 
details about computation of level of significance in the 
binomial test. 

c. Find the Confidence Limits 

The data elements are ranked; the ranked values 
are called "order statistics", so that X(1) is the smallest 
value, X (k) is the largest values ia a data set of k values. 
The confidence limits are found by computing the values r 
and s, which are indices of the order statistics; once they 
are found, X(r) is the lower limit, X (s) is the larger 
limit . 
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(1) with less than 21 data elements, a binomial 
pmf is generated, with parameters k = number of data 
elements, and p = p*. R is found by summing terms of the 
pmf from the 0th term (P(N=0)) until the first term which 
causes the sum to be greater than the half of the hypothezed 
level of significance (alpha*/2) . The index of that term is 
r. S is found by summing terms of the pmf from the kth term 
(P(N=k)) until the first term which causes the sum to be 
greater than alpha*/2. The index of that term is S. 

This procedure is conservative on both bounds, and 
therefore may be very conservative overall. For example, 
with k = 18, p* = 0.5, and alpha*= 0. 15, Crunch would choose 

r of 5 and s of 13; the probability that in the population 

an X would fall between X(5) and X(13) is 0.904, while the 

user would have been satisfied with about 0.85. With the 

binomial tables, the user might choose indices 6 and 13, or 
5 and 13, which would have only 0.0481 in one tail but 
0.1151 in the other, so that the test would have almost 
exactly the alpha desired. 

(2) With more than 20 data elements (=k) , the 
hypothesized variance is k(p=»“(1-p*)); its square root is the 
sample standard deviation. The expected value is kp^. R is 
the expected value plus the quantity standard deviation 
times the (alphaV2) quantile of the standard normal distri- 
bution. If r is not an integer, round up. s is the 
expected value plus the quantity standard devation times the 
(1-alpha*/2) quantile of the standard normal distribution. 
Again, round up if necessary to make s an integer. 

Pro gra m Method of 0 perat ion 

The subroutine celler calculates T1 and T2 by 
calling the subroutine dicot, which takes the values of the 
data set and find the number less than and the number less 
than or equal to the specified x*. The subroutine quantile 
receives that information, then calls the subroutine bintst 
to perform the binomial test described above. 
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Quantl then calls 
generate a binomial pmf to 
indices r and s, if the 
less. Otherwise, Quantl 
values of the { alpha */2) a 
normal distribution to use 
The data values are sorted 
X(r) and X(s) values derefe 
3 • P roqra m Cha r acteri s 



vari 



Nchu 



a. 



able is 

b. 



zk. Bin 



Limitations 
Maximum number 
used . 



Subroutines/P roc 


(1) 


Loch invar 


: Q 


(2) 


Crunch: 


Qua 


pmf 


or Normv 




(3) 


External: 


Sh 



subroutines Nchuzk and Binpmf to 
calculate the confidence limit 
total number of values is 20 or 
calls the subroutine Normv for 
nd (1-alpha*/2) quantile of the 
in the calculation of r and s. 
by the subroutine RNQ, and the 
renced as the confidence limits. 
tic s 

of data points: UOO. Only one 

edures Used 
uantile 

ntl. Cellar, Dicot, Bintst; Rnq, 
sort 
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VII. MANN WHITNEY TEST 



A. OVERVIEW 

^ • Genera l Purp ose 

With two variablss, tast whether tha populati 
from which tha variables ware drawn have the same mean. 

2 . Level of Meas urame n t 
Ordinal 

3 . A ssump tions 

a. Each variable is a random sample. 

b. There is mutual indapendance between samples. 

4. H ypoth eses 

a. Two tailed test 

(1) Null hypothesis hO: E(X)=E(Y) 

(2) Alternative hypothesis hi: E(X)?‘E(Y) 

b. One tailed tast 

(1) Null hypothesis hd: S(X)>E(Y) 

(2) Alternative hypothesis hi: E(X)<E(Y) 

c. One tailed test 

(1) Null hypothesis h3: E(X)<E(Y) 

(2) Alternative hypothesis hi: E (X) >E (Y) 



B. ACTION REQUIRED HY USER OF PROGRAM 

1 • User Uscisio ns about Program Op ti o ns 

The user must decide which two of the variables 
use in the test if the data set contains more than two. 
user specifies the level of significance. 

2 • Progra m A c t i vat i o n when Using D a ta Va lues 
a. Initial Choices 

Sea section III.C for initial choice details. 



4 1 



ons 



to 

The 



In CMS, type LOCHI. I£ data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 4. If data needs to be entered, 
enter "N" for first question, ”7'' for second; when data 
entered, choose to do a test; whan offered a choice of 
tests, type 4. Enter ” U" , "V , . . . "Z" for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar; 

(1) Enter indices of variables to be used in 

test , 

(2) Enter the level of significance. 

3 • Tes t B x ecu ticn 

Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

4 . Output Ava il able to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., ”U LISTING”. The names of the 
data set and variables used are at the top of the output 
file. The empirical level of significance and whether to 
reject the null hypothesis are stated. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Descr i ption of Cal culation of Test Statis tic/Leve l 
of S ignificanc e 

a. Sums of the Ranks of the First Variable 

Rank the values of the two variables. Sum the 
ranks of each. The test statistic T is the sum of the ranks 
of the first variable. 

b. Overall Test Statistic 

The overall test statistic, if their are many 
ties, is T1. Let n be the number of elements in the first 
variable, m the number of elements in the second variable; N 
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is the sum of n and m. Sum the squared ranks of both vari- 
ables to find R2 . The numerator of the test statistic is T 
minus n(N+1)/2. The denominator is the square root of the 
difference of two terms: R2*nm/N (N- 1 ) , and 

(nm (N + 1) **2) /4 (N-1) . 

2* Progra m Method of 0 perat ion 

The subroutine Manwht calls the nonimsl subroutine 
Utest, to get the level of significance. 

3 • P rogra m Cha r acteri s tics 

a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables: 2. 

b. Subroutines/Proc edures Used 

(1) Lochinvar: Manwhit 

(2) Crunch: Manwht 

(3) External: Ot est 
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VIII. COX-STU ART TEST FOR TREND 



A. OVERVIEW 



1 • G enera l Purp ose 

This is a specialized fora of the sign test, to 
determine if the data elements ii the second half of a 
sample are significantly greater or less than the corre- 
sponding values in the first half. with one variable, the 
order of the data elements determines how the data elements 
are paired for comparison. 

With two variables, in which the data elements are 
paired, the ranks of the independent variable orders the 
values of the dependent. This yields a rough test of the 
interaction or dependence of the two variables. 

2. Level of Measurement 



Ordinal 

3. A ssump tions 

a. The data elements represent random draws 

b. Either there is no trend, or there is an under- 
lying trend upward or downward throughout data. 

4. Hypoth eses 

Where (+) indicates the second data element is 
greater than the first in the pair, otherwise (-) , so that 
P(+) =P(-) indicates no trend, P(<-)>P(-) indicates upward 
trend (1 variable) or vary directly (1st variable to 2d 
variable), and P(+)<P(-) indicates downward trend or vary an 
inverse relationship. 

a. Two tailed test 

(1) Null hypothesis hO: ?( + )=?(-) 

(2) Alternative hypothesis h1: ?(+)^P(-) 

b. One tailed test 

(1) Null hypothesis hO; P(+)<P(-) 
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(2) Alternative hypothesis hi: P (+) >P (-) 
c. One tailed test 

(1) Null hypothesis hO: P( + )>?(-) 

(2) Alternative hypothesis hi; P (+)<?(-) 



B. ACTION REQUIRED BY USER OF PROGRAM 

^ • User Dec is io ns about Pr og raa Options 

If one variable is used, make sure data elements are 
in the order (often order of time) for rhe trend to be 
tested. If two variables are to be used, data elements must 
be paired. The user selects the type of hypothesis and the 
level of significance. If there are two variables, the user 
specifies which is dependent. 

2 • P rogra m Acti vat i o n when Using Dat a Va lu es 

a. Initial Choices 

See section III .C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter ”Y*' to first question, then enter fils name; 
when cffered a test, type 3. If data needs to be entered, 
enter "N" for first question, "V” for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 3. Enter ” U” , "V , . . . '*Z ” for desired output 
file. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 

Enter index or indices of variables to be 
Enter pairs to be excluded, if using two 



( 1 ) 

used in test, 
variables. 

( 2 ) 

significance. 

( 2 ) 

deoendent. 



3nter letter of hypothesis type and level of 
If there are two variables, enter which is 
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4 . 



Test E x e cu t ion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CHS. 

5 . O utput A vai lable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., ”0 LISTING". The names of the 
data seo and variables used are at the top of the output 
file. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . D escri ption of Calc ulatio n of Level of Sig nificance 

a. Form Pairs (X,Y) Significance 

(1) If there is one variable, call it Y, and go 
to step (2). If two variables, let X be the independent 
variable, and Y be dependent. Find the order of the X. Use 
that ordering to re-order Y in ascending order of the values 
of X. 

(2) Let k be the number of elements in the vari- 
able Y. If K is even then there will be (k/2) pairs, if odd 
then ((k-1)/2) pairs. Let np equal the number of pairs. 

b. Find Test Statistic 

(1) For i = 1 to np, compare Y (i) to Y(np-i+1) . If 
Y(i) > Y(np-i+1) then increment the counter for failures. If 
Y(i) < Y(np-i+1) then increment the counter for successes. 
The test statistic is from the binomial test, with n equal 
the number of successes plus failures, p* equal 0.5, and the 
number of success equal number of times the second half data 
elements are strictly greater than the corresponding first 
half elements. 

2* Progra m ^thcd of 0 perat ion 

a. Ordering the Dependent Variable 

The subroutine Coxstu dereferences the X values 
into the INDEP vector, sorts those values using Shsort, and 
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the indices returned in the vsctor KEY to dereference 
Y values into a vector TLIST. 

If there is only one variable, use a do for loop 
oad its values, in their current order, into TLIST. 

b. Computing the Test Statistic 

The subroutine Coxstu calls the subroutine Signt 
erform the sign test. See section IX, 

3 • P rogra m C h a racteri s tic s 

a. Limitations 

Maximum number of data points; UOO, Maximum 
er of variables: 2, 

b, Subroutines/Procedures Used 

(1) Lochinvar; Coxstu 

(2) Crunch: Coxstu, Signt 

(3) External; Shsort 
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IX. SIGN TEST 



A. OVERVIEW 



1 • G enera l Purp ose 

with pairs of data (Xi,Yi) , determine if the 
expected value of the first element, E(X) , is greater than 
the expected value of the second element, E(Y). 

2. Level of Measurement 

Ordinal within each pair; that is, it can be deter- 
mined that Xi is greater than Yi (+) , or that Yi is greater 
than Xi (-) , ar that they are equal. The values of the pair 
do not have to be ordered with any values outside the pair. 

3. Assump tions 

a. Eaadom Sample 

X,Y are a random sample of the population pairs. 

b. Level of Measurement 

Ordinal within each pair, nominal among pairs. 

c. Invariance of Probability that Xi > Yi 

The underlying population from which the sample 
is drawn is assumed to have a constant P (X>Y) . 

^ • Hypothes is 

Here, (+) is the case that X>Y, (-) is the case that 
Y>X, where X,Y refer to population random variables. 

a. Two tailed test 

(1) Null hypothesis hO; P(+)=P(-) 

(2) Alternative hypothesis hi: ? (+) ^P (-) 

b. One tailed test 

(1) Null hypothesis hO; P(+)<P(-) 

(2) Alternative hypothesis hi: ?(+)>P(-) 

c. One tailed test 

(1) Null hypothesis hO: P(+)>P(-) 

(2) Alternative hypothesis hi: ?(+)<P(-) 
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B. ACTION REaUIRED BY USER OF PROGRAM 



1 • User De cis ions a boa t P rog ram Options 

It is possible that the data is non-numer ical, but 
ordinal within pairs, e.g., a list of winners of arm- 
wrestling contest between two platoons as a measure of arm 
strength. If so, then perform the test as a binomial test, 
entering cell counts, with number of successes as the number 
of times Xi > Yi, n is the total number of untied pairs, and 
p* as 0.5. 

With numerical data, the program will make the 
comparisons. The user must enter the hypothesis letter and 
the hypothesized level of significance. 

If there are more than two variables in the desig- 
nated data set, the user must choose which are included in 
the test. 

2. Progra m Act i vat ion when Using D at a Values 

a. Initial Choices 

See section III .C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter to first question, then enter file- name; 

when offered a test, type 5. If data needs to be entered, 

enter ''N'* for first question, "V" for second; when data 

entered, choose to do a test; when offered a choice of 

tests, type 5. Enter ” U" , •'V , • • • "Z” for desired output 
file . 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of variables to be used in 
test. Enter blocks or pairs to be excluded, if any. 

(2) Enter rhe letter of the hypothesis type and 
the hypothesized level of significance, alpha*. 
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3 . Test Bx ecu t ion 

Tc perform the test, after the above entry of data 
and parameters, type CRONCH while in CMS. 

4 . O utput Ava il able to ^e U se r 

The program CRUNCH displays the output file to which 
the data has been sent, e.g,, '*0 LISTING". The names of the 
data set and variables used are at the top of the output 
file. The test statistic and the level of significance are 
printed, as well as whether to reject the null hypothesis. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . D escript io n of Cal c ulation of Level of Sig nificance 

a. Calculation of the Test Statistic 

For each pair (Xi,Yi) , subtract Yi from Xi. 
Count the number of times the difference is greater than 0: 
this is the test statistic. The total number of untied 
pairs is the count of times the .difference is not zero. 

b. Calculation of the Level of Significance 

The level of significance is calculated by the 
binomial test, with the number of successes equal to the 
test statistic, the total sample size equal to the number of 
untied pairs, and the hypothesized proportion of success p* 
equal to 0.5. 

See section V for details about computation of 
levels of significance by the binomial test. 

2* P rogra m Method of 0 peration 

The subroutine Signt compares the values of the 
lower indexed variable with the corresponding values of the 
other variable. Values within epsilon (=0.0031 by default) 
are considered tied. If not tied, then the program checks 
whether Xi>Yi or Xi<Yi, and increments the appropriate 
count, (+) or (-) . Signt then calls the subroutine Bintst 
to compute the level of significance. 
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Maximum 



3 . 



number 



P rogra m Char acteri s tic s 

a. Limitations 

Maximum number of data points: 400. 

of pairs: 200. Required number of variables: 

b. Subroutines/Procedures Used 

(1) Lochinvar; Sign test 

(2) Crunch: Sgnt 

(3) External; Shsort 



2 . 



i 



51 



X. MCNEMAR TEST FOE S IGN IFICANCE OF CHANGE 

A. OVERVIEW 

• G enera l Pu rp ose 

With data in pairs (Xi,Yi), and x and y values each 
classifiable as successes or failures, to determine if a 
significant change has occurred. 

2* Level of Mea sure me n t 
Nominal 

3 . A ssump tions 

a. The pairs are a random sample. 

b. X and Y can be divided into mutually exclusive 
and collectively exhaustive groups: success and failure. 

c. Each of the k pairs caa be categorized intc one 

of four cells: success/success success/failure (1,0), 

failure/success (0,1), and failure/f ailure (0,0). 

4. H ypoth esi s 

a. Null hypothesis 

HO: P (0, 1)=P(1,0) for all i 

b. Alternative hypothesis 

HI: P (0, 1) *P (1,0) for all i 

B. ACTION REQOIRED EY OSER OF PROGRAM 

• D ser De cis io ns about P rog ram Optio ns 

If the data is nominal, the user must enter the 
counts of each of the four categories. 

If the data is ordinal or above, the user may enter 
cell counts, or may specify partition values to dichotomize 
X and Y. Lochinvar will ask for the partition value for X, 
and whether successes are less than it. For example, if 
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17.5 is the partition value for X, and successes are greater 
than it, any X value greater than 17.5 will be a success; 
depending on its Y value, the pair will be counted in the 
success/success (1,1) cell, or success/failure (1,0) cell. 
After asking for the X partition value, Lochinvar will ask 
for the Y partition value and whether successes lie above or 
below . 

If there are more than two variables in the desig- 
nated data set, Lochinvar will ask which two are to be used 
for the test. 

2« Pro gra m Acti vation when U sing Cell Co unts 

a. Initiation and test selection 

In CMS, type LOCHI. Enter '*N" for no to question 
about previously entered data. Enter "C" for cell counts. 
Enter "U” . . ."Z ” for desired output file. When offered a 
choice of tests, enter 6. 

b. Cell count entry 

Lochinvar will ask the cell counts for each of 
the four categories; success/success, succass/failure, 
failure/success, and failure/failure. 

Enter the hypothesized level of significance. 

P rog ram Ac ti v ation when Using Data Values 

a. Initial Choices 

See section III .C for a derailed explanation of 
the following. 

In CMS, type LOCHI. If dara is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 6. If data needs to be entered, 
enter for first question, "V" for second; when data 

entered, choose to do a test; when offered a choice of 
tests, type 6. Enter " U'*, "V”,. . . '*Z'* for desired output 
file. 
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b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of the two variables to be 
used in test. Enter blocks or pairs to be excluded, if any. 

(2) Enter the partition value for the lower 
indexed variable and whether successes are less than that 
value. Enter the partition value for the variable with the 
higher index; indicate whether successes are less than that 
value . 

(3) Enter the hypothesized level of significance. 

• T es t Exe cut ion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5 . Out put A vail able to th e User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING”. The names of the 
data sen and variables used are at the top of the output 
file . 

If there are 20 or less pairs in the data set which 
are classified as (1,0) or (0,1), than the test statistic is 
T2=number of pairs classified as (1,0); that test statistic 
and level of siginif icance will be listed. Otherwise, the 
test statistic T1, explained below, and the level of signif- 
icance will be given. 

In either case, the program will compare the empir- 
ical level of significance with the hypothesised, and state 
whether the null hypothesis is rejected. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

^ • Descri ption of Calc ulatio n of Level of Sign i ficance 
a. Computation of Cell Counts 

If cell counts were provided by the user, go to 
step b or c. If data values were used. Crunch examines each 
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pair (Xi,Yi) to see the relation =ach data element has to 
its corresponding partition value. Crunch decides which 
cell the pair belongs to, ( 1 , 1) , . . . (0 ,0) , and increments the 
count for that cell. 

b. If Count for Cells (1,0) and (0,1) less than 21 

The test statistic is T2, which equals the count 
of cell (1,0). The binomial test is invoked to find the 
level of significance. T2 is the number of successes; the 
total count of cells (1,0) and (0,1) as n, rhe total sample 
size; the hypothesized proportion of success, p* is 0.5. 
See section V for details of compuration. 

c. If Count for Cells (1,0) and (0,1) exceeds 20 

The test statistic is T1 , equal to the squared, 
difference between the cell counts (1,0) and (0,1), divided 
by their sum. This test statistic is compared to the chi- 
square distribution with one degree of freedom to find the 
level of significance. 

2. Pro qra m Method of 0 peration 

a. Cell Count Calculation 

The subroutine Cellar compares each of the lower 
indexed values to the first partition value, then each of 
the other variable’s values to the second partition value, 
at each comparison, a "1” is assigned to an integer vector 
if the value is a success, otherwise a "0" is assigned. The 
resulting integer vector, with the same number of elements 
as the original data set, is then used to find cell counts. 
With k pairs, the first through the kth elements of the 
integer vector are multiplied by two, then added to the 
corresponding k+lst through 2kth elements. The resulting k 
element vector has values 0,1, 2, 3, corresponding to the four 
cells. The vector is then used to compure cell counts. 

b. Calculation of the Test Statistic 

Crunch figures if the cell counts of (1,0) and 
(0,1) total more than 20. If so, T1 is computed, otherwise 
T2 is used. 
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c. Calculation of the Level of Significance 

If T1 is used, the subroutine mcnemar performs 
linear interpolation of tabled values of the chi-square 
distribution with one degree of freedom. If T2 is used, the 
binomial distribution subroutine Bintst is used to calculate 
the level of significance. 

3 . P rogra m Char acteri s tic s 

a. Limitations 

Maximum number of data points; 400. Maximum 
number of pairs: 200 if program computes cell counts; no 

limit if the user enters cell counts. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Cel_test, McNemar 

(2) Crunch: McNemr, Cellar, McCount 

(3) External: Shsort 
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XI. CHI-SOJ ARS TEST FOR (R*C CONTINGENCY) 

A. OVERVIEW 

1 . G enera 1 Purp ose 

Osed with a data set oDmposed of paired data 
elements, to test whether the classification or value of the 
first element of the pair is statistically independent of 
the classification or value of the second. 

2 • Level of Mea sure men t 
Nomin al 

3 . A ssump tions 

a. The pairs is randomly drawn from the population. 

b. The first element of eaoh pair can be placed in 
exactly one of r categories (rows), and the second element 
of each pair can be placed in exartly one of c categories 
(columns) . 

• Hvpoth es is 

a. Null hypothesis 

HO: P(pair is in row i and col j) equals P(first 

element in row i) times P(second element in row q) 

b. Alternative hypothesis 

HI: P(pair is in row i and col j) does not equal 

P(first element in row i) times P(seoond element in row j) 

B. ACTION RE30IRED BY USSR OF PROGRAM 

^ • Use r Decisions about Pr og ram Options 

a. Entry of cell counts 

If the data is nominal, the user must decide the 
categories of the rows and columns, and enrer the observed 

cell counts, Oij, for each cell. The user may use this 

technique for data of a higher level of measurement. 
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b. Entry of fixed row and column totals 

If the data is ordinal or above, and the categor- 
ization of the data points is by their order (e.g., not by 
whether their last digit is odd), the user may enter Hi’s 
and Cj's. The sums of the Hi’s and Cj's must exactly equal 
the number of pairs in the data set. 

c. Entry of row and column partition values 

If the data is ordinal or above, and the categor- 
ization is based on order, the user can specify (r-1) values 
to divide the data elements into r rows, and (c-1) values to 
divide the data elements into c columns. For example, a 
data pair would fall into cell (1,2) if the first element of 
the pair were less than or equal zo the first row partition 
value, and the second element of the pair were greater rhan 
the first column partiton value but less than or equal to 
the second. 

2* P rogra m Acti vati on when Osing Cell Counts ( choice a) 

a. Initiation and test selection 

In CMS, type LOCHI. Enter '*N” for no to question 
about previously entered data. Enter "C* for cell counts. 

Enter . . ."Z” for desired output file. When offered a 

choice of tests, enter 7. 

/ 

b. Cell count entry 

Lochinvar will ask the number of rows and 

columns. It will then prompt entry of the observed cell 

count for each cell. 

3 • P rogra m Acti vat ion Using Data Values (choice b or c) 
a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 7. If data needs to be entered, 
enter ’’N” for first question, ••V" for second; enter the 
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data; when offered a choice of tests, type 7. Enter 
"U” , ” V" , . , . "Z" for desired output file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of the two variables to be 
used in test. Enter the number of rows and the number of 
columns. Enter whether using fixed row and column totals or 
partition values. 

(2) If by fixed totals, enter the row totals 
when prompted, then the column totals. 

(3) If by partition value, enter the row parti- 
tion values when prompted, then the column partition values. 

'* • Tes t Ex ecu tion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5 . O utput Ava i lable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., ”U LISTING". The data set and 
variable names are at the top of the output file. The test 
statistic T, the degrees of freedom, and the matrix of 
observed cell counts are listed. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

^ • Des cri ption of Comp utatio n of l est Stati stic 
The program Crunch will: 
a. Find the observed cell count 

Specify the r rows and c columns, thus creating 
r*c mutually exclusive cells. Count the number of pairs 
which fall into the categorization of each cell; that number 
is the observed cell count, Oij, of cell ij. 
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b. Find the expected cell count 

All the cell counts in row i are summed to find 
the row total Ri. All the cell counrs in column j are 
summed to find the cclumn total Ci. The expected cell count 
Eij is found by multiplying Ri times Cj and dividing by the 
total number of pairs, N. 

c. Find the test statistic T 

For each cell, subtract Oij from Eij. Square 
each difference. Divide each squared difference by the 
corresponding Eij. Sum those quotients to find the test 
statistic T, which can be compared to a chi-square distribu- 
tion with (r-1)*(c-1) degrees of freedom. The quantile of 
the chi-square distribution corresponding to T indicates the 
empirical confidence coefficient, or the complement of the 
empirical level of significance. 

2 • Progra m Operatio ns 

a. Computation of the cell counts 

If the user supplied the cell counts, go to step 
b. If the user gave fixed row and column totals, rank all 
the first elements to find the row partition values, and 
rank all the second elements to find the column partition 
values. Compare the first element of each pair to rhe row 
partitions: assign to each pair the index of the largest 

partition value which the first element is less than; if the 
first element is larger than the largest partition value, it 
is assigned the integer r. Do the same with the second 
elements and the column partition values. Each pair now has 
two integers assigned, which designate membership in a row 
and in a column. Count the number of pairs falling in each 
cell; this is the observed cell count Oij. 

b. Compute the zest statiszic T 

Find the row and column totals by summing Oij’s. 
Compute the expected cell counts Eij by Ri*Cj/N. Square the 
difference (Oij-Hij) , divide by Eij, sura over ail cells. 
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• Progra m Char acteri s tic s 

a. Limitations 

Maximum number of data points; 400, if the 
program computes the cell counts; unlimited if the user 

supplies cell counts. Maximum number of variables/ 
treatments: 2. Maximum total cells (r*c) ; 50. 

b. Subroutines/Procedures Used 

(1) Lochinvar; Cel^test, Rc_cont ; or Which_test, 
Rc_cont, Set_cell ,Mhich_vec, Exclud_b 

(2) Crunch: Cell er , Partv, rc3 or Tc4, Rcctr , Rcont 

(3) External: Shsort 



XII. CHI-5QUARS GO ODN ESS OF FIT TEST 



A. OVERVIEW 

• G enera l P a rp ose 

With a sample of 15 or more data elements, test if 
that sample could be drawn from a specified distribution. 

2 • Level of Mea sure men t 

Interval 

3 . A ssump tions 

The variable contains a random sample. 

4 . Hypoth esis 

Where F* (x) is hypothesised distribution function, 
and F (X) is the distribution of the population. 

a. Null hypothesis 

HO; F (X) =F*(x) for all X 

b. Alternative hypothesis 

HI; F(x)^F*(x) for at least one x 

B. ACTION REQUIRED EY USER OF PROGRAM 

1 . User De cis io ns about Prog ram Op tions 

The user must decide among five distributions for 
which to test. Parameters may be specified by the user, or 
may be estimated by the program (using method of moments) . 

2- Progra m Ac t ivat ion when Using Da ta Values 

a. Initial Choices 

See section III.C for initial choice details. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
whan offered a test, type 8. If data needs to be entered, 

enter "N” for first question, "V" for second; enter data. 
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and when offered a choice of rests, type 8. Enter 
"U" , ''V*', . . . ”Z'' for desired our put file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar; 

(1) Enter indices of variables to be used in 
tesr. Enter blocks or pairs to be excluded, if any. 

. (2) Enter theoretical distribution . Choose 

among: uniform, normal, exponential, erlang and wiebull. 

(3) Enter the parameters of the theoretical 
distribution, or request that the program estimate the 
parameters. In the normal and exponential the method of 
moments can be invoked to allow the program to estimate the 
maximum likelihood estimates of the parameters. In the case 
of the erlang, the user must specify the integer shape 

parameter n, but can allow the program to estimate the scale 
parameter lambda. In the case of the uniform and the 
weibull, the user must specify the parameters 
3* Te s t Execution 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CHS. 

O utput A vail able to ;yie User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "0 LISTING". The names of the 

data set and variables used are at the top of the output 

file. The test statistic T and the degrees of freedom will 
be displayed. Finally, the matrix of cell counts, expected 
and observed, will be given. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

^ • Descri ption of Calc ulatio n of T est Stat istic 
a. Calculation of the Expected Cell Counts 

Let k equal the number of data elements. If 

15<k<=50, divide k by 5 and round to the nearest integer. 



63 



This will be C, the number of cells; e.g., k=17 implies C=3, 
k=18 implies C=4. In all cells except perhaps the first and 
last the expected cell count will be 5. If C*5=k, then the 
expected cell counts for all cells is 5. If C*5=k+1, rhe 
expected cell count of the first call is 4; if C*5=k+2, the 
expected cell count of the first and last cells is 4. If 
C*5=k-1, the expected cell count of the first cell is 6; if 
C*5=k-2, the expected cell count of the first and last cells 
is 6. For example, if k=17, C=3; since 3*5=17-2, the first 
and last cells have expected counts of 6, i.e., the expected 
cell counts are 6,5,6. 

If k>50, there will be 10 cells. Divide k by 10 
and truncate; that will be the minimum expected cell count 
E. The remainder of the truncation is rem=k-10*E; add 1 to 
the cell count of the last rem cells. 

The expected cell counts are stored in the 
vector EJ. This is the building block of the F=P (X<x) 
values of the theoretical distribution. Scan the EJ vector 
to find the CUM vector, i.e., CUM ( 1 ) =EJ ( 1) , 

COM (2) =EJ (1) +EJ (2) , ..., CUH(j)=sumof EJ (i) , i = 1,..,j. 

Divide each element of the COM vector by the number of 
elements, k, to form the COMP vector. 

b. The Partition Values 

The usual form of theoretical distributions takes 
a value x and returns a value P, such that F = P (X<x) . In 
this case, the inverse form of the theoretical distributions 
is used to take an F, contained in the COMP vector, and 
return an x. The first (C- 1) values returned form the 
partition values which will divide those data elements into 
cells. 

c. The Observed Call Count 

The data elements are compared to the partition 
values contained in the vector PARTV. An element falls in 
the ith cell if it is larger than PARTY (i-1) but less than 
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or equal to PARTV(i). Elements lass than PARTV(l) are in 
the first cell; elements larger than PARTY (C-1) are in the 
last cell. The observed cell counts are contained in ~he 
vector RCCT. 

d. The Test Statistic 

For each cell, subtract EJ (i) from RCCT(i) , 
square that difference, and divide that square by EJ(i). 
Sum over all cells. This is the test statistic T, with 
degrees of freedom (C-1) . 

2. P rogra m Method of 0 Deration 

The subroutine Gdfit calls the subroutine Muvar to 
find the sample mean and variance if parameters must be 
estimated. The subroutine Cpvgen figure the EJ, CUM, and 
CUMP vectors. The vector PARTY is figured in one of five 
subroutines, depending on the theoretical distribution spec- 
ified; unid, normd, expd, erld, weid. The observed cell 
counts RCCT are found in Celcnt. The test statistic T is 
computed in the subroutine Gdfit. 

3 • Proqra m Cha racte ris tics 
a. Limitations 

Maximum number of data points: 400. Maximum 

number of variable: 1. 
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A. 



OVERVIEW 



XIII. WILC OXO W SIGNED RANK TEST 



1 . Genera l P q irp ose 

With two paired variables (X,Y), the signed rank 
test determines whether the expected values of populations X 
and Y could be the same. 

This requires interval data. If the data is only 
ordinal, use *-.he sign test. 

Level of Measurement 

Interval 
3. A ssump tions 

The pairs (Xi,Yi) are random samples from the popu- 
lation of all pairs. 

• H ypoth eses 

a. Two tailed test 

(1) Null hypothesis hO: E(X)=E(Y) 

(2) Alternative hypothesis hi: E(X)5tS(Y) 

b. One tailed test 

(1) Null hypothesis hD:E(X)<E(Y) 

(2) Alternative hypothesis hi; B (X) >E (Y) 

c. One tailed test 

(1) Null hypothesis h3; E(X)>E(Y) 

(2) Alternative hypothesis hi; E (X) <E (Y) 

B. ACTION REQOIRED EY USER OF PROGRAM 

1 , O^r De cis io n s about Progra m O p tio ns 

To make the test more flexible, a constant term can 
be added to the higher indexed variable, so that the test 
becomes a question of the relationship of E(X) and E(Y)+b. 
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By default, the value of b is 0; but the user can specify 
any real number. 

Lochinvar also asks for the hypothesis type and 
level of significance. 

2. P rogra m Act i vation when D at a Va lues 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CHS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 9. If data needs to be entered, 
enter "N" for first question, ''V” for second; when data 
entered, choose to do a test; whan offered a choice of 
tests, type 9. Enter " U”, "V", . . . "Z” for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of variables to be used in 
test. Enter blocks or pairs to be excluded, if any. 

(2) Enter whether a constanr rerm is to be added 
to the value of the higher indexed variable, Y. By default, 
the constant is 0. 

(3) Enter the letter of the type of hypothesis 
and the hypothesised level of significance. 

• Test Ex ecu tion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5 • Output Avail able to ^e User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "0 LISTING". The names of the 
data set and variables used are at the top of the output 
file. 

Crunch will provide the sum of negative ranks and 
sum of positive ranks, as well as the test statistic T in 
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all cases. If there are fewer thaa 23 pairs, table look up 
is required; otherwise, the program will compute the empir- 
ical level of significance. 

when the number of untied pairs is less than 20, the 
output will also contain the vector Dif, the differences 
Yi-Xi, in ranked order. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Descr iption of Calculation of Test Stat istic/Le vel 
of S igni f icanc e 

a. Sums of the Negative and of the Positive Ranks 

Two sums are calculated, one for the negative 

differences — those pairs in which Xi>Yi — and one for the 
positive differences. Pairs in which Xi=Yi are not consid- 
ered. Subtract Xi from Yi , to form the vector Dif. Take 
the absolute value of Dif to form Adif. Rank the values of 
Adif tc form the vector Rank. If Yi<Xi, the ith value of 
Rank is added to the sum of negative ranks, then that ith 
value is multiplied by negative 1. If Yj>Xj, the jth value 
of Rank is added to the sum of positive ranks. 

b. Computation of Test Statistic 

The vector Rank was originally the ranks of the 
absolute differences; now, the ranks for the negative 
differences have been multiplied by -1. Sum the vector Rank 
to find the numerator of the test statistic. Sum the 
squares of the elements of Rank to find the denominator of 
the test statistic. Divide numerator by denominator. 

c. Level of Significance 

If the number of untied pairs is 20 or less, use 
table A-13 'ref 1], using the sum of positive ranks. 
Otherwise, Crunch computes an approximation, using the test 
statistic T and the standard normal distribution. 
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P rogra m Met hod of 0 Derat ion 

The subroutine Wilcox finds the Dif and Adif 
vectors# and calls the subroutine Rangue (which in turn 
calls Rng) to find the vector Rank. Wilcox computes the 
sums of the positive and negative ranks# as well as the test 
statistic T. If the number of untied pairs is 20 or less# 
it will display the sums of ranks. Otherwise# it will call 
the subroutine Normv to find the approximate empirical level 
of significance, and display whether to reject the null 
hypothesis. 

3 • P rogra m Cha racte ristics 

a. Limitations 

Maximum number of data points; 400. Maximum 
number of variables: 2; maximum number of pairs; 200. 

b. Subroutines/Proc edures Used 

(1) Lochinvar: Wilcoxon 

(2) Crunch: Wilcox# Rangue# Rng#Normv 

(3) External; Shsort 
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XIV. NONPftRAMETRIC CORRELATION 



A. OVERVIEW 

1 . G enera 1 Purp ose 

With n pairs of data elements, X and Y, test whether 
the value of X is statistically independent of the value of 
Y, or if it varies directly or inversely with Y. 

Three different test statistics are available by 
running this one program: Pearson's rho, Spearman's rho, and 
Kendall's tau. 

2* Level of. Measurement 

Interval for the Pearson's rho to have meaning. 

Ordinal for Spearman's rho and Kendall's tau. 

3 . A ssump ti on s 

X,Y are a random sample of population pairs. 

4 . H ypoth esis 

Where rho is a measure of oor relation 1 <rho<= 1 

a. Null hypothesis 

HO: rho(X,Y) = 0, a necessary condition for 

statistical independence 

b. Alternative hypothesis 

H1: rho (X,Y) * 0 

B. ACTION REQUIRED BY USER OF PROGRAM 

• Ose r De cis io ns about Progra m Op rions 

The user chooses which pair of variables from a data 

sat are used in a test, and whether to exclude any of the 

pairs from the test. As described below, the computation of 
Kendall's tau uses an algorithm of complexity level 

n-squared (when comparing every pair with every other pair); 
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if there are more than 40 pairs Lochinvar offers the user 
the chance to skip the Kendall's tau test. If so, only 
Spearman's rho and Pearson's rho (which may have no meaning 
if the data is ordinal) will be computed. Otherwise, all 
three will be produced. 

2 • P rogra m Act i v ation when Using P ara Valu es 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 10. If data needs to be entered, 
enter ”N” for first question, "7" for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 10. Enter " U", "V , . . . "Z” for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) If there are more than two variables in the 
data set, enter the indices of the two to be used. Enter 
pairs to be excluded, if any. 

(2) If there are more than forty pairs, enter 
whether the Kendall's tau should be computed. 

^ • Test S x ecu ticn 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5* O utput Ava il able to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. Spearman's rhc, Pearson's rho, and Kendall's tau are 
listed. 
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C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

"I- Descri pt io n of Calc ulatio n of Te st Stat is ri cs 

a. Pearson's Rho 

Pearson's rho is SXY divided by square root of 
SXX times SYY. The deviations of X are found by subtracting 
the mean of X from each value of X; deviations of Y are Yi 
minus mean of Y. The product of deviation of Xi times devi- 
ation of Yi, summed over all pairs, is SXY. The sum of 
squared deviations of Xi is SXX; the sum of squared devia- 
tions of Yi is SYY. 

b . Spear man ' s Rho 

Rank the values of X. Rank the values of Y. 
Those pairs of ranks (R(Xi) ,R(Yi)), are then used to compute 
SXY divided by square root of SXX times SYY. That is, 

Spearman's rho is Pearson's rho performed on the ranks of 
the values rather than the values themselves. 

c. Kendall's Tau 

Tau is roughly the measure of the number of 
concordant pairs minus disc oncordant pairs divided by total 
number of possible pairing. Concordance implies positive 
correlation; e.g., if pair of the data set number 4 (X4,Y4) 
has an X value greater than, say, X5 , and a Y value greater 
than Y5, then this is one small indication that there is 
positive correlation between X and Y. Therefore, the 4th 
and 5th pairs of the data set form to make one concordant 
pair. If X6 is greater than X4, but Y6 is less than X4, 

this is a little evidence of negative correlation; the 4th 
and 6th pairs of the set form to make one disconcordant 
pair. Ties in either X or Y mean that the comparison yields 
neither concordance nor disconcordance. 

To compute tau, order the pairs (Xi,Yi) in 
ascending order of X's. Compare the first pair (the one with 
the lowest X value) to every other pair. If XI is tied with 
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Xi or Y1 with Yi, go to the next pair; if not, (Y1 < Yi) 
implies concordance, (Yl > Yi) implies disconcor dance; 
increment the relevant count. After the first pair is 
compared with all others, the second pair is compared with 
all succeeding pairs (3,4,...,k), incrementing counts as 
necessary. Each pair is compared to all succeeding pairs, 
until Xk-1 is compared to xk. Kendall's tau equals the 
number of concordant pairs minus the number of disco ncordant 
pairs, all divided by the total possible number of pairings 
(k(k-1)/2). If all pairings are concordant, tau will equal 
1; all disconcor dant pairings will yield an -1. 

2* Progra m Meth od of 0 Deration 



a. 


Pearson's and Spearman's Rhos 
The subroutine Rhoer calculates 


SXY, SXX, SYY 


for 


any input 


va lues , 


and returns--inter 


al ia--rhc. 


For 


Pearson ' s 


rho use 


the values of the 


data set. 


For 


Spearman ' s 


rho, use 


subroutine Rng to rank 


the values of 


the 



data set, and send these ranks to Rhoer. 

t. Kendall's Tau 

Since the values have already been ranked, use 
the indices of ranked X to order the pairs (R(Xi) ,R(Yi)) . 
Ranks work the same as values for finding concordance or 
disconcordance , but ties are easier to discern with ranks. 
With nested do for loops, compare the ith pair (i = 1 to 
k-1) with pairs j = i+1 to k; R(Xi) = R(Xj) or R(Yi) = R(Yj) 
means to continue with looping, otherwise, if R(Yi) < R(Yj) 
then increment concordant counter, if 5(Yi) > R(Yj) then 

increment disconcordant counter. At the end of the nested 
loops, divide the difference of the concordant counter minus 
the disconcordant counter by the term (k(k-1)/2), 

3 . P roqra m Cha r acts ris tic s 

a. Limitations 

Maximum number of data points; 400. Maximum 
number of pairs; 200. 

b. Subroutines/Procedures Used 
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(1) Lochinvar: CorrelS 

(2) Crunch; Corr el, Rnq, Sho er 

(3) External: Shsort 
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XV. NONPARA MET RIC RES RES SION 



A. OVERVIEW 

1 . Gen era 1 Purpose 

Regression tests the effect of the value of indepen- 
dent variable X has on the value of the independent variable 
Y. The linear regression model assumes that the effect of X 
on Y is linear, that is, E ( Y | X) =E (Y) b1 (X-E (X) ) . This test: 

(1) Finds estimates of b1. 

(2) Predicts values of Y given values of X 

(3) Test if the linear model is correct 

(4) Finds confidence limits for the value of b1 
2* Level of Measurement 

Ordinal, to test the linear model 

Interval, for the other purposes listed above 

3. A ssump tions 

a. Raidom sampling of pairs. X may be specified, as 
long as the Y portion of the pair, given X, is independent. 

b. The linear model is the correct model. 

4 . H ypoth esis 

Where b1 is slope the parameter; b1* hypothesized. 

a. Null hypothesis 
HO: b1=b1* 

b. Alternative hypothesis 

HI: blitbl* 



B. ACTION REQOIRED EY OSER OF PROGRAM 

'' • Oe cis io ns about P rogra m O pti ons 

a. The user chooses the t’-fo variables to be used, 
and designates which one is dependent. The program will 
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automatically fit the least squares model, finding b1. The 
intercept parameter bO, E (Y | X) =bO + b 1 *X, is also computed. 

b. The user can input values for X to predict values 
of Y based on the least squares parameters. 

c. The user can specify a hypothesised b1* to zest 
whether a linear model is appropriate. The default value is 
b1* = 0; this is the classical test of whether an indepen- 
dent variable predicts a dependent variable. 

d. The user can request confidence intervals on the 
slope parameter b1. As described below, the algorithm to 
compute , those intervals is of degree of complexity n 
squared, that is, for n pairs(x,y), n(n-1)/2 comparisons 
must be made and stored to find the confidence intervals. 
For larger values of n, zhe user should consider not 
requesting this option. 

If the user does choose to have the confidence 
intervals computed, the value W will be requeszed by 
Lochinvar. That value is explained by Conover [Ref. 1: p. 
267], and contained in table A-12 of that reference. 

2 • P rogra m Act ivation whe n n§is.l D ata Values 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 11. If data needs zo be entered, 
enter "N” for first question, '*V' for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 11. Enter ” U", "V” ,. . . "Z” for desired output 
file. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 

(1) Enter indices of variables to be used in 
tesz. Enter pairs to be excluded, if any. 
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(2) Enter values cf X to compare E(Y|X) based on 

least squares linear fit. 

(3) Enter hypothesized slope parameter b1*. 

(4) Enter whether confidence limits are desired. 

3* E x ecut i cn 

To perform the test, after the above entry of data 
and parameters, type CSUNCH while in CMS. 

^ • O utput A va il able to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING”. The names of the 
data set and variables used are at rhe top of the output 
file. The least squares fit parameters will be listed, and 
the Spearman’s rho statistic for the specified value of b1 . 
If the user requested expected values of Y for some values 
of X, the values of X and Y will be listed. Finally, if 
confidence bounds were requested, those will be given. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Descr i pti on of Cal culat ion of Test Statis tic / Leve l 
of Significance 

a. Find the Regression Parameters b1,b0 

This step is identical to parametric simple 
regression. Find the means of X and Y, and the vectors of 
deviations (Xi-mean of X), (Yi-mean of Y) . Multiply the 
corresponding deviations of X and Y together, and sum the 
products to form SXY. Square the deviarions of X, then sum 
those squares, to form SXX. Sum the squared deviations of Y 
to find SYY. The slope parameter b1 is SXY/SXX. The inter- 
cept parameter bO is the mean of Y minus b1 times the mean 
of X. The expecred value of Y given X is bO plus bl times 
X. Residuals are the difference between the expected and 
actual value of Y. 
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b. Test the Value of X as a Predictor of Y 

In parametric regression, it is standard to test 
whether the slope parameter b1 could be 0 in the entire 
population. In this package, any value b1* can be set as 
the null hypothesis. The test checks to see if the resi- 
duals of the fit using b1* are correlated with the values of 
X; if not, then the fit is considered adequate. The resi- 
duals are (Yi-b1*Xi), the vector of residuals and the inde- 
pendent variable X are compaired using the Spearman’s rho 
test. See section XIV for details of that test. 

c. Predictions of Y based on Least Squares Fit 

With the least squares parameters bO and b1 
already computed. Crunch multiplies each given X times b1, 
adds bO, and displays the predicted Y. 

d. Confidence Bounds for the Slope Parameter b1 

Form the vector of slopes S. S(1) is (X(1)-X(2)) 

divided by (Y(1)-Y(2)); S(2) is (X(l) -X (3) ) /(Y(1) -Y (3) ) , and 
so on. with k pairs (X, Y) , there are N=k(k-1)/2 elements of 
the slope vector, when comparing each i with each other j 
once . 

Sort the vector S. The indices of the confidence 
limits are r (0.5*(N-W)) and s (0.5* (N+W) +1) , where N is the 
total number of slopes and W is the user entered value from 
table A-12\of Conover. If necessary, round r or s up to 
integers. The lower limit on b1 is S (r) , the upper limit is 
S(s) . 

2. Proora m Method of 0 perat i on 

The subroutine Regret calls the subroutine Rhoer to 
find the least squares parameters b1 and bO. Regret figures 
the residuals for the specified b1*, and sends the residuals 
to subroutine Correl to have Spearman’s rho calculated. 
Regret figures the vector of slopes S; that vector is sorted 
by Shsort; Regret finds indices s and r and the confidence 
limits zo b1. 
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3 • P iogra m Cha r acteristics 

a. Limitations 

Maximum number of data points: 
number of variables: 2. 

b. Subroutines/Procedures Used 

( 1 ) Lochinvar: Regret 

( 2 ) Crunch; Regret, Rhoer, Correl 

( 3 ) External: Shsort 



V 



400 . 
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XVI. MQNOTONIC regression 



A. OVERVIEW 

1 . G enera l Purpose 

With independent variable X, and dependent variable 
Y, find the E(Y|X) equatinn, based nn a least squares fit of 
the ranks of X and Y. Linear interpolation finds values from 
X to E(rank of X) to E(rank of Y) to E(Y). 

2. Level of Me asure ment 

Interval 

3. A ssump tions 

a. The sample is a random sample 

b. The regression of Y on X is monotonic. 

4 . Hypoth esis 

This test does not produce a test statistic; the 
assumption is that the monotonic model is correct. Instead, 
it will predict expected values of X for given values of X, 
Also, it can produce a trace of the monotone regression, 
which 'is ai^alogous to the straight line of the least squares 
fit of linear regression. 

B. ACTION REQUIRED BY USER OF PROGRAM 

1. User De cis i o ns about P rog ram O ptions 

The user must specify the index for the independent 
value X, and dependent variable Y. The user can include 
values of X to have predicted a corresponding value of Y. 
The user can request a trace of the monotone regression. 

2 . Progra m Act i vati on 

a. Initial Choices 

See section III.C for initial choice details. 
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In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 12. If data needs to be entered, 
enter "N" for first question, "7” for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 12. Enter " U", "V" , . . . "Z" for desired output 
file . 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of variables X and Y. Enter 
pairs to be excluded, if any. 

(2) Enter whether to predict values of Y for 
specified values of X. If yes, enter values of X. 

(3) Enter whether a trace is desired. 

3 • Test Bx ecu t ion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

U . Out put A vai lable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. In all cases, there will be listed the parameters bO 

and b1, the intercept and slope of the least squares fit on 

the ranks of X and Y. If the user specified X's to predict 

Y's, the predicted values will be listed. Finally, if a 

trace was requested, the pairs of X's and Y's will be given. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Descri pt ion of Calc ulati ons 

a. Least Squares Fit on Ranks of X and Y 

Sort and rank X and Y. With n pairs, X (i) and 
Y(j) are the order statistics, RX (i) and RY(j> are the 
ranks, i=1,...,n, h=1,...,n. Compute the least squares fit 
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to find bO and b1, such that E (RY | RX) =b0 +b 1 *RX. See section 
XV for details of least square computations. 

b. Predicaticn of Y Based on Values of X 

Linear regression will extrapolate values of Y 
for a given value of X if that X is outside the range of the 
observed values X(l), X(n). Monotone regression will 

not extrapolate; if the user specifies an X outside the 
range of the observations, that X will not yield an expected 
Y. 

Let Xp be one of the values the user specified to 
predict a Y, Yp. Find the pair of values of the order 
statistics of X which bracket Xp, X (i) <Xp<=X (i+ 1 ) . The 
expected rank of Xp, RXp, must fall between those order 
statistics* ranks, R X (i) <RX p<=RX (i*- 1 ) . Use linear interpo- 
lation to find RXp: RXp= RX(i) (Xp-X (i) )/ (X (i+ 1) -X(i) ) 

♦ (RX (i+1) -RX (i) ) . 

Use the parameters bO and b1 to find RYp, the 
rank of Yp: RYp=bO+b1*RXp. 

Find th“ ranks of Y which bracket RYp, 
RY ( j ) <RYp<=RY ( j + 1) . The value of Yp lies between Y(j) and 
Y(j+1). Use linear interpolation to find Yp: Yp= Y(j) + 

(RYp-RY(j) ) /(RY (j+1) -RY(j) ) * ( Y ( j + 1 ) - Y ( j ) ) . 

c. Trace of Monotone Regression 

The trace is a set of lines, berween a sequence 
of points (Xi,Yi), which describe the monotone regression. 
When plotted, the trace can be used to predict a value of Y 
for an X. This is similar to using the straight line of 
linear regression, Y=bO + b1*X, for prediction. 

To find the endpoints of the trace, use X(1) and 
X(n) , and the procedure of part b, to find S(Y|X(1)) and 
5 (Y| X (n) ) . 

Intermediate points of the trace will use the 
values of Y to predict expected X. Bach value of Y has a 
rank, RY, which is used with bO and b 1 to find RXt (trace). 
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RXt= (RY-bO)/b1. For each RXt, fiad the pair of RX's 
bracket it. Ose linear interpolation to find Xt. 

Pioara m Mat hod of 0 perat ion 

The subroutine Monore calls Rnq to rank and sort 
values. The subroutine Correl finds the least squares 
parameters bO and b1 from the rank vectors RX and RY. The 
subroutine ESPY performs estimations of Y for given X's, 
calling Rtov (which calls Interp) to do the linear interpo- 
lations. The subroutine Trace performs the trace, calling 
Vtor, Rtov and Interp. 

3 . P rogra m Char act er i s tic s 



a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables; 2. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Monoreg 

(2) Crunch: Monore, Rtov, Vtor, Esty, Interp, Rnq 

(3) External: Shsort 
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XXVir. MEDIA N TEST 



A. OVERVIEW 

1 . Genera l Purp ose 

With two or more samples, this test will determine 
if all the populations from whioh the samples were drawn 
could have the same median. 

As described below, the package allows combining of 
data set variables into composite variables in order to test 
population groups for common median. 

The median may be a surrogate for expected value. 

2. Level of Measurement 

Ordinal 

3 . A ssump tions 

The variable were randomly irawn from the respective 
populations, and are mutually independent. 

4 . Hypothes is 

With k populations, X (1) ,X (2) ,. . . , X(k) , and the 
median denoted as M, 

a. Null hypothesis 

HO: M (XI) =M(X2) = . ..=M (Xk) 

b. Alternative hypothesis 

HI: M(Xi)^M(Xj) for at least one of the pairs 

B. ACTION REQOIRED BY OSER OF PROGRAM 

• Dse r De cis io ns about P rogra m Options 

The user chooses which variables of the data set to 
include in the test. 

Aggregate variables may be formed by having Crunch 
combine the expected cell counts of several variables into 
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ona or more aggregate variables. For example, let dara set 
Farmland contain six variable, in order, A through F. 
Variable A is corn yield per acre of farm land treated with 
fertilizer and pesticide. B is corn yield with fertilizer 
but no pesticide. C is corn yield with pesticide but no 
fertilizer. D is corn yield per acre from land untreated by 
either fertilizer or pesticide. Variables E and F have 
other values. Variables A and C could be combined into 
aggregate variable 1, and B and D into 2, to test whether 
the median yield with pesticide is agual to the median yield 
without pesticide. Similarly, A and B, and C and D, could 
be combined into aggregate variables 1 and 2, to test the 
median of yields with and without fertilizer. 

Proper planning and labeling of the variables will 
facility this aggregation. 

2. Pr ogra m Act i vat ion when Osing Da ta Valu es 

a. Initial Choices 

See section III .C for a detailed explanation of 
the following. 

In CHS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 

when offered a test, type 13. If data needs to be entered, 

enter "N" for first question, "V” for second; when data 

entered, choose to do a test; when offered a choice of 
tests, type 13. Enter " U" , "V”, . . . '*Z" for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar; 

(1) Enter indices of variables to be used in 

test. Enter blocks or pairs to be excluded, if any. Enter 

whether to use aggregate variables. 

(1) If aggregate variables are not used, enter 
indices of variables to be used in the test. 
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(2) If aggregates are used, 
prompt for a choice on each data set variable, i=1 to one 
number of variables in the data set. If the ioh da~a set 
variable is to be excluded from the test, enter 0. If it is 
to be included, enter the index of the aggregate variable 
into which it should be combined. For example, with the 
Farmland data set described above, Lochinvar would prompt 
user choices on data set variable i= 1 to 6. Let the test 
be a comparison of yields with or without pesticide, thus 
variables A and C into aggregate variable 1, B and D into 2, 
and E and F excluded. When prompted for the data set var 1, 
enter 1. For data set var 2, enter 2. For data set var 3, 
enter 1. For data set var 4, enter 2. For data set var 5 
and 6, enter 0. 

4. Test Execution 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5 • Output Avail able to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., ”U LISTING”. The names of the 
data set and variables used are at the top of the output 
file. The program will print the test statistic and the 
degrees of freedom, as well as a two by number of variables 
contingency table, showing how many elements of each vari- 
able are less than or equal to the grand median of all vari- 
ables, and how many are greater than. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

^ • Descri ption of Calc ulatio n of Test Statis tic 

a. Calculation of the Cell Counts of the M Matrix 
The M matrix has two rows and a column for each 
variable in the test. The A row has the number of elements 
in each variable less than or equal to the grand median; the 
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B row has the number of elements greater than rhe grand 
median. The values of all the variables are copied into a 
vector, which is sorted; the median of all those values is 
the grand median. The counts of the cells of the matrix M 
are set to 0. Each element of the first variable is compared 
to the grand median; if less than or equal, the k row first 
column cell count is incremented, otherwise increment B row 
first column cell count. After all elements of the first 
variable are compared to the grand mean, do the second vari- 
able, incrementing the 2 column's cell count, et cetera, 
b. Computing the Test Statistic 

Sum all counts of the A row to find Atot, and all 
counts of B to find Bnon. Sum Atot and Brot to find N. Sum 
the first column counts to find NAV ( 1) , the second column 
counts to find NAV(2) , et cetera. The expected cell count 
for cell aj is Atot*NAV ( j) /N ; for bj, Btot*SAV ( j) /II. The 
test is in essence the chi-square rest for independence (r*c 
contingency table), with the tesn statistic as the sum over 
each cell of observed minus expected count squared, divided 
by the cell expected count. The actual equations used 
exploit the structure of the problem. The degrees of 
freedom axe (r-1)(c-1) = number of variables minus one. 

2* Program Method of 3 Deration 
a. Aggregation of Variable 

The subroutine Media assumes that aggregate vari- 
ables are always used. iJhan not, the program uses k aggre- 
gate variables where k equals the number of data set 
variables included in the test; aggregate variable 1 is data 
sat variable 1 , . .. , aggregat e variable k is data set variable 
k. When aggregates are in fact specified, their values, 
contained in r? vector, will be used. In either case, the 
vector AGVAEI will contain the indices relating each data 
set variable to an aggregate variable. 
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b. Cell counts 

The matrix M is a fiction. The vector NA is the 
count of elements less than or equal to the grand median; 
it is arranged according to aggregate variable indices. SB 

is the count of elements greater than the grand median. In 

a do fcr loop, i=1 to the number of data set variables, k is 
set to AGVARI(i) , and each element of the ith data set vari- 
able is compared to the grand median. If the element is 
lass than or equal to the grand median, increment NA (k) 

(=NA ( AGVAHI (i) ) , otherwise, increment NB(k). 

c. Compute the Test Statistic 

NA(i) and NB(i) are summed to find NAV (i) . All 
elements of NA are summed to find A, all elements of NB are 
summed to find B. A plus 3 equals S. The test statistic is 
computed using the specialized aquations of Conover [ref 1, 
p. 172]. The degrees of freedom are the number of aggregate 
variables minus one. 

3 • P roqra m Cha r acte ris tic s 

a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables or aggregate variables: 15. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Median, Ag_med, Mhich_vec , None_ex 

(2) Crunch: Media 

(3) External: Shsort 
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XVIII. KROSKAL WALLIS TSST 



A. OVERVIEW 

1 • Genera l Purp ose 

with a data set of more than two variables, 
X1,X2,. . . ,Xk, test whether the expeoted values of all vari- 
ables are equal. 

If this test's null hypothesis is rejected, then it 
is possible to determine which of the pairs of variables 
have an unequal mean. 

If the data ser is blocked, consider instead of this 
test the Cochran (nominal) , Frieiman (ordinal) or Quade 
(interval) tests. If paired, consider the Sign (ordinal) or 
wilcoxcn (interval) tests. 

2- Level of Mea s ure me n t 

Ordinal 

3 . A ssu mpti on s 

a. Each variable is a random sample from within its 
popu laticn. 

b. The sampling was mutually independent among the 
populations. 

c. The population distributions are identical, or 
else some of the populations tend to have higher values than 
others. 

4 . Hvp oth es is 

a. Null hypothesis 

HO: E (XI) =E(X2) = . . . E(Xk) 

b. Alternative hypothesis 

HI: E(Xi)#E(Xj) for at least one pair of popula- 

tions represented by the data set. 
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B. ACTION REQUIRED EY USER OF PROGRAM 

1 . User De cis io ns a bout P rog ran Optio ns 

The user must specify which variables of a data set 
are included in the test. 

After the test is run. Crunch will present the 
overall test statistic, and a series of comparisons between 
the variables. If, and only if, the null hypothesis has 
been rejected by the overall test statistic may the user 
infer any information by the comparisons between the pairs 
(Xi,Xj) of variables. 

2 • P rogra m Acti vation when Usin ^ Data Valu es 

a. Initial Choices 

See section III .c for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, ent.er "Y” to first question, then enter file name; 
when offered a test, type 14. If data needs to be entered, 
enter "N" for first question, '•V" for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 14. Enter '* U" , "V” , . . . "Z" for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar, enter indices of 
variables to be used in test. 

3 • Test B x ecu t i on 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

^ • O utput A vail able to th e Us er 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. The overall test statistic will be given. Also 
listed will be comparisons of pairs of variables; these 
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comparisons only have meaning if the null hypothesis has 
been rejected. The paired comparison are essentially a t 
test run on the ranks of the two variables being compared, 
testing the hypothesis that their mean ranks are the same. 
Crunch supplies the absolute difference in their mean ranks 
and the pooled standard deviation; the user must find the 
appropriate value of t to conduct the t test. 



C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 



1 . Calc u lat ion of Test Statistic for the Null 



Hyp ot hesis 

a. Find the 
R an k al 1 
in the ith variable 
elements in the ith 
the ith variable is 
ables is N. 



Sums of Variable Ranks Ri 
values in the data set. Sum the ranks 
to get the value Ri. The number of data 
variable is ni, so the average rank in 
Ri/ni. The sum of ni over all k vari- 



b. Find the Test Statistic 

The test statistic numerator is the sum over k 
treatments of (Ri**2)/ni, minus ( (N+ 1 ) **2) *N/4. To find the 
test statistic denominator, S2, first sum over all data 
elements the square of the ranks. Subtract from this sum of 
squares the term ( (Ntl) **2 ) *N/4) . Divide that difference 
by (N-1) to get S2 . Divide the numerator by S2 to find the 



test statistic T. 



c. Make Comparisons of the Pairs of Variables 

As mentioned above, this step is the student t 
test on the ranks of two variables to see if their popula- 
tion mean could be equal; it means nothing if the null 
hypothesis has not been rejected. 

For each pair of variables Xi,Xj, find the abso- 
lute difference in mean ranks, j Ri/ni-R j/n j| . Find the 
pooled variance, (S2 * (N- 1-T ) / (N-k) ) times (1/ni+1/nj). The 



pooled standard deviation is the square root of the pooled 
variance. If the absolute difference is greater than the 
pooled standard deviation times rhe desired value of t, 
reject the assertion that the mean of populations i and j 
are the same. 

2. Fro gra m Method of 0 per at ion 

The subroutine Kruwal calls Ranque to rank the 
values of the data set. In a nested do for loop, i=1 to k, 
j=1 to ni, rhe intermediate terms of the statitics are 
computed. Kruwal prints the overall test statistic to the 
specified output file. It then makes pairwise listings of 
the absolute difference of mean ranks and the corresponding 
pooled standard deviations. 

3. P roqra m Cha racteris tic s 

a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables/treatma nts: 15. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Kruwal 

(2) Crunch: Kruwal, Ranque 

(3) External: Shsort 
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XIX. SgOARED RANK TEST FOR E^tJAL V^IANCE 

A. OVERVIEW 

• G enera l Purp ose 

with two or more treatments in a data set, test 
whether the population variances of those treatments are 
equal . 

2. Level of Measurement 
Interval 

3. A ssump ti ons ; 

Randoa sampling within each treatment population, 
and among treatment populations. Level of measurement is 
interval. 

^ • H ypoth es e s 

a. Two tailed test, where X,I are treatments 

(1) Null hypothesis hO: va r (X) = var (Y) 

(2) Alternative hypothesis hi: var (X) ^^var { Y) 

b. One tailed test, two treatments 

(1) Null hypothesis hO: var (X) >=var (Y) 

(2) Alternative hypothesis hi: var ( X) <var ( Y) 

c. One tailed test, two treatments 

(1) Null hypo'-hesis hO: v ar (X) =<var (Y) 

(2) Alternative hypothesis hi: var (X) >var (Y) 

d. Number of treatments (Xi) are three or more 

(1) Null hypothesis hO: var (X 1) =var (X2) = 

. . . = var (Xk) 

(2) Alternative hypothesis hi: var (X i) #var (X j) , 

for some i, i, indices of the data set 
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B. ACTION REQUIRED BY USER OF PROGRAM 

1 • User De cis io ns abo ut P rogra m Oori ons 

The program allows the usar to choose which treat- 
ments to include in the test. 

2 . Pr oqra m Activ ation whe n Usin £ Da t a Valu es 

a. Initial Choices 

See secrion III .C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "I” to first question, then enrer file name; 
when offered a test, type 15. If data needs to be entered, 
enter '•N** for first question, "V'» for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 15. Enter " U” , "V" ,. . . "Z" for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar, Enter indices of 
variables to be used in test. 

Test Ex ecu tion 

Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5. Output Avai lable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING”. The names of the 
data set and variables appear on the top of the output file. 
Also included are the test statistic and the number of 
treatments used to compute the statistic. 
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C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 • D escri pt io n of Calc ula tio n of T es t Statistic (T 1/T2 ) 

a. Find the ranks of each absolute deviation: 

Compute the sample mean for each of k treatmenrs. 

Subrracr the treatment mean from each of the treatment's 
elements. Take absolute values of each of those differ- 
ences. Rank all N absolute differences (N=total elements in 
data set) . 

b. Complete intermediate statistics: 

(1) SUMR, the sum of all ranks. 

(2) SDMR2, the sum of the square of all ranks. 

<3) S0MR4, the sum of all the fourth power of 

each rank, 

(4) SOMRJ, the k element vector with the sum of 
the ranks for each treatment, 

(5) SUMR2J, the k element vector with the sum of 
squared ranks for each treatment. 

(6) R2MU, the average squared rank: SUMR2/N. 

(7) RMO, the average rank: SUMR/N. 

c. For two treatments compute the statistic T1 

(1) Numerator: ( S0MR2J (1) -n1 *R2MU) 

(2) Denomination: SUMR4*n1 *n2/(N*(N-1) ) , minus 

(RMU**2) *n1*n2/ (N-1 ) 

(3) T 1=nu marator /denominat or 

d. For three or mors treatments compute T2 

(1) Numerator: difference between these two 

terms, sum over k treatments of (SUMR2J**2) /ni, minus 
(R2MO**2) *N 

(2) Denominator: (SUMR4-N* (R2MU**2) ) / (N-1) 

(3) T2=numerato r/dsnominator 
2 . P rogra m Ope r ations 

The program uses subroutine MUVAR to find treatment 
means. The subroutine Squark finds the absolute deviations; 
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while rhe subroutine Rnq ranks them. The subroutine Squark 
Uses nest "do for" loops, i= 1 to k (treatments) and j=1 zo 
ni (elements in treatment) , to caluculate the vscrcrs SUMRJ 
and S0flR2J, and the sums SUMR, SUMR2, SUMR4. RMU and R2MU 
are found by dividing SUMR and SUMR2 by N. Depending on the 
number of treatments, T1 or T2 is found by simple arithmetic 
using the intermediate statistics. 

3« Progra m Char acteri s tic s 

a. Limitations 

Maximum number of data elements is 400. Maximum 
number of treatments is 15. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Squarank, which^vec 

(2) Crunch: Squark, Muvar, Rnq 
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XX. hartley test for E2IJAL VARIMCE 



A. OVERVIEW 

1 • G enera l Par p ose 

Test for equal variance amoaq the k underlying popu- 
lations of the variables or treatients, reflected by the 
values for those treatments in the data set. 

2. Level of Mea sureme nt 

Interval 

2 . Ass sum £t ion s 

Random sampling within each treatment, independence 
of sampling among treatments. Interval level of measurement 

4 . H ypoth esis 

a. Null hypothesis 

HO: var (XI) =var (X2) =. . . =var (Xk) , where k= number 

of treatments 

b. Alternative hypothesis 

HI: var (Xi) ^var ( Xj) for at least one pair 

B. ACTION REQUIRED BY USER OF PR03RAM 

1 . Use r De cis io ns ab 2 U t Program Options 

The program allows the user to choose which treat- 
ments to include in the test. 

2. Progra m Activation when Using Data Values 

a. Initial Choices 

See section III.C for initial choice details. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 16. If data needs to be entered, 
enter *'N” for first question, *'V* for second; when data 
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entered, choose to do a test; when offered a choice of 
tests, type 16. Enter '• U" , '»V'» , . . . "Z” for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochiavar, Enter indices of 
variables to be used in test. 

^ • T es t 2 xe cut ion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CHS. 

5 . o utput A vaila ble to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING'*. The names of the 
data set and variables used are at the top of the output 
file. The there are the maximum and minimum variances and 
indices of the associated treatments. Finally, there is the 
F statistic, to be compared to a table prepared for the 
Hartley test. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

^ • D escri pt ion of Calc ulatio n of Test Stat istic 

Crunch does the following; Compute the sample vari- 
ance for each treatment. Find the smallest and largest 
variance. Divide the largest variance by the smallest vari- 
ance to yield the test statistic F. Compare the test 
statistic to a special table to see if the null hypothesis 
should be rejected. 

2. Progra m Method of 0 perat ion 

The treatment variances are figured in the subrou- 
tine MUVAR. The subroutine HARTLE scans the variances to 
find the largest and the smallest, divides the former by the 
latter, and gives that number as the test statistic F. 
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3 • P rogra m Char acteri s tic s 

a. Limitations 

The maximum number of data elements is 400; 
maximum number of treatments is 15. 

b. subr outine/Procedures Involved 

(1) Lochinvar; Hartley, Mhich_vec 

(2) Crunch: Hartle,Muvar 

(3) External: Shsort 



the 
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XXI. FRIEDM AN TEST AND DURBIN TEST 



A. OVERVIEW 

• G enera l Pu rp ose 

To perform a treatment and block test, similar to 
ANOVA, using only ordinal data. In fact, the data may be 
rankable only within each block; i.e., no relationship 
between data values in different blacks is necessary. 

The Friedman test is for data sets cf complete 
block design, i.e., each treatment is applied to each block. 

The Durbin test is a generalized Friedman -^est, for 
use with incomplete block design. with t treatments and b 
blocks, each treatment is applied to r blocks (r<b) , and 
each block receives k treatments (k<t). The user must guar- 
antee that the design is balanced, and insert a filler value 
in place of missing data values when entering the data set. 

Each test will generate a set of differences 
between pairs of treatment summed ranks. , If, and only if, 
the null hypothesis is rejected, these differences may be 
used, in conjunction with a standard deviation value and the 
student's t distribution, to determine if there is a signif- 
icant different between two treatments. 

2. Level of Wea sur emen t 

Ordinal within each block. 

3. A ssump tions 

Values within blocks are independent among blocks. 

4. Hypoth esis 

a. Null hypothesis 

HO: Each possible ranking within the block is 

equally likely (that is, the treatment effects are equal) 

b. Alternative hypothesis 
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HI: At least one of the treatments rends to get 

higher ranks than at least one other treatment. 

B. ACTION REQUIRED EY USER OF PROGRAM 

1 . Use r Decisions about Progra m Poti o ns 

The user must decide which treatments and blocks to 
include in the test. If a incomplete block design is to be 

used, the user must specifically enter the data sen for this 

purpose: there will be t variables, and b elements entered 

for each variable. In those elements which have a no data 
value (a treatment which was not applied to some block) , 
enter a filler value. In a data set, there can be only one 

filler value, and it must be different than any of the data 

values. For example, if all the data values are positive, 
-1 could be used as a filler. 

2* Progra m' Acti vation when Using Da^ Va lues 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CHS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 17. if data needs to be entered, 
enter ”N" for first question, ”7" for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 17. Enter ’• U", "7" , . . . "Z" for desired output 
file. 

b. Yariable and Parameter Choice 
When prompted by Lochinvar: 



(1) 


Enter whether to use 


the 


Friedman 


or Durbin 


(2) 


If the Friedman test 


is 


cho sen , 


enter the 



treatments to be used and any blocks to be excluded. 
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(2) If the Durbin test is chosen, enter the 
treatments to be used, any blocks to be excluded, k= number 
of treatments applied to each block, r= number of blocks to 
which each treatment is applied, and the filler value. 

^ • Tes t Execut ion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5. Out put Available to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING”. The names of the 
data set and variables used are at the top of the output 
file. The overall tesr staristic for the null hypothesis 
will be given. In addition, the difference between pairs of 
treatments' summed ranks will be lisred; these are mean- 
ingful only if the null hypothesis is rejected. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Descri ption of Calc ulati on of Te st St at is ti c 

a. Use of Filler, If Durbin Test 

If the Durbin rest is indicated, find the maximum 
value in the data set. Add 1 to that maximum value to form 
MPI. In a do for loop, substitute MPl for the filler value, 
and create the integer ’'vect or INTEST. INTEST (i) equals 1 
if the ith element in the data set is not the filler value, 
0 if the ith element is a filler. 

b. Rank Values Block by Block 

In both tests, find the ranks of the values of 
the data set block by block. Load those ranks into the R 
vector. 

In the Durbin test, the filler elements will have 
a higher rank than those of data elements, since the fillers 
were loaded with MP1. Therefore, after all the ranking is 
complete, multiply the rank vector times INTEST; this sets 
the ranks of all filler elements to 0. 
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c. Suns of Treatment Ranks and Total Sums 

Square each rank of the rank vector R, and sum 
those squares over all treatments and blocks to form A2. Sum 
the ranks of each treatment to form the vector RJ; RJ (j) 
contains the sum of ranks of the jth treatment. 

d. Test Statistic for the Friedman Test 

Square each element of the RJ matrix; sum those 
squares; divide that sum of squares by the number of blocks 
to find B2. The denominator of the test sraristic is A2-B2. 
Let b be the number of blocks, t the number of treatments. 
The numerator of the test statistic is (b-1) * (B2 -0.25bk* 

(k+1)*»2). Divide the numerator by the denominator to get 
the test statistic T2. Compare T2 to the F distribution 
with (k-1) and (k-1) (b-1) degrees of freedom 

e. Test Statistic for the Durbin Test 

Sum the squares of RJ. Multiply that sum of 
squares time 12(t-1). Divide that product by rt(k-l) (k+1), 
where r is the number of blocks to which each treatment is 
applied, and k is the number of treatments applied to each 
block. This quotient is the first term of the test 
statistic. The second term is 3r(t-l) (k+1) divided by 
(k-1) . Subtract the second term from the first term to find 
the test statistic T. T is compared to the chi-square 
distribution with t- 1 degrees of freedom. 

f. Paired Comparisons of Treatments 

Crunch will print the following numbers in all 
cases, but they only have meaning when the null hypothesis 
is rejected. For each i,j of the treatments, the absolute 
difference between the sums of treatment ranks is listed. 
The number that is equivalent to the standard deviation is 
also computed. For absolute differences greater than the 
standard deviation times the t quantile (1-alpha/2), the 
treatments can be considered to have a different effect. 
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• Proqra m Mathcd of 0 perat ioa 

The subroutine Friedm perforins most of the computa- 
tions. The subroutine Ranque ranks the values block by 
block. The subroutine Parcom does the comparisons of treat- 
ment sums of ranks. 

3 • Pr oqra m Cha racteris tic s 

a. Limitations 

Maximum number of data points; 400. Maximum 
number of variables/treatments: 15. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Friedman 

(2) Crunch; Friedm, Ranque, Rnq, Parcom 

(3) External; Shsort 
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XXII. OU ADE TEST 



A. OVERVIEW 



1 . G enera l Purp ose 

With interval data organized into t treatments and b 
blocks, test whether all treatments have the same effect. 
If not, treatments can be compared for differences in pairs. 

2. Level of Mea sur ement 
Interval 



3 . A ssump tions 

a. The values of each blocks are independent of the 
values of the other blocks. 

b. The sampling within each block can yield a range 
of the elements; that range must be rankable among blocks 

4 . Hypoth es is 

a. Null hypothesis 

HO: Each possible ranking within the block is 

equally likely (that is, the treatment effects are equal) 

b. Alternative hypothesis 

HI: At least one cf the treatments tends to get 

higher ranks than at least one other treatment. 



B. ACTION RE3UIRED BY USER OF PROGRAM 

• User Decisions about PP 03 ram Op t io n s 

The user must specify the treatments to be included 
in the test, and whether any blocks are to be excluded. 
Crunch will provide the absolute difference between pairs of 
treatments; if and only if the null hypothesis is rejected, 
these comparisons can be used to test whether the effects of 
the pairs are equal. 
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2 . P roqra m Activ ation whe n Using P ara Vala es 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 18. If data needs to be entered, 
enter "N" for first question, "7" for second; enter the 
data; when offered a choice of tests, type 18. Enter 
mjii , (I V»» , . . . «z'* for desired output fils. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: Enter indices of 

variables to be used in test. 

3 • Test Ex ecu tion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

4 . O utput A vailable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING'*. The names of the 
data set and variables used are at the top of the output 
file. The test statistic T and its degrees of freedom are 
listed. Finally, the absolute difference between the pairs 
of sums of treatment ranks are given. These differences, 
when used with a standard deviation computed by Crunch and 
the t distribution, can be used to check for equal effect 
between pairs of treatments. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

• D escri ption of Calcu latio n of Test St ati st ic 
a. Rankings within and among Blocks 

Rank each element in the block; assign the 

elements to the matrix of ranks R. Subtract the largest 
element from the smallest element; that becomes the block's 
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range, stored in the vector R. after all blocks have had 
their elements ranked, rank the vector R; those ranks form 
the vector Q. 

b. The Size Matrix S 

with t equal to the number of treatments, the 
average rank of R is (t+1)/2. Subtract that average rank 
from- each element of R. Multiply the reduced ranks of the 
first block by Q(1), the reduced ranks of the second block 
by Q (2) f 6t cetera, tc form the matrix S. 

c. Sums of the Treatment Elements of S 

Sum the elements of the jth rrearment of S over 
all blocks to form the jth element of the vector SJ. Square 
each element of SJ, sum those squares over all treatment, 
and divide by b, the number of blocks, to find B1. Square 
all elements of S, and sum all those squares to find A1 . 

d. Test Statistic 

The test statistic T1 is (b-1) time B1 divided by 
the quantity A1 minus B1. Compare T1 to an F statistic with 
degrees of freedom (t-1) and (b-1)(t-1), 

e. Pairwise Comparisons 

These values have meaning only if the null 
hypothesis is rejected. Compute the absolute difference 
between each pair of the vector SJ. If that difference is 
greater than the student's t quantile times the computed 
standard deviation, the pair of treatments can be said to 
have a different effect. The standard deviation is found by 
taking the square root of (2b(A1-B1)/(b-1) (t-1)) . 

2* Progra m Met hod of 0 perat ion 

The subroutine Quade calls Ranque to find the matrix 
R and the vector Q. The test statistic is computed in 
Quade, as is the standard deviation. The subroutine Parcom 
makes the pairwise comparison. 
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nu m b e r 



Proqra m Cha r acteris tic s 

a. Limitations 

Maximum number of data points 
of variables/trea tme nxs; 15. 

b. Subroutines/Procedures Usad 

(1) Lochinvar: Quade 

(2) Crunch: Quade, Ranque, Rnq, 

( 3 ) External: Sbsort 



400. 



Parcom 



Maximum 
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XXIII. COCHRO T^T FOR RELATED OBSERVATIONS 

A. OVERVIEW 

1 . Genera l Purpose 

with a data set of b blocks, to which k treatments 
have been applied, test whether the treatments have been 
equally effective. "Effective" must be reducible to two 
categories, mutually exclusive and collectively exhuastive. 
Here, these categories will called success and failure. 

If the data is not reducible in this fashion, the 
Friedman (ordinal) or Quads (interval) tests are available. 

2. Level of Mea sur ement 
Nomina 1 

3 . A ssump tions 

The blocks are a random sample from among all blocks 
'4 . Hvpoth es is 

a. Null hypothesis 

HO: The treatments are equally effective 

b. Alternative hypothesis 

H1; There is a difference in effectiveness among 

the groups. 

B. ACTION REQUIRED BY USER OF PROGRAM 

"I • Pser De cis i ons about P rog ram Options 

The user must decide the standard for success or 
failure of each treatment. If the data is nominal, the user 
should enter cell counts. See paragraph 2, below. If the 
data is ordinal, the usee may enter a partition value for 
each treatment of a data set; see paragraph 3. If this 
latter method is used. Crunch will classify each value of -an 



109 



entered data set as a success or failure by its relationship 
to the partition value for its treatment. 

If partition values are used in conjunction with a 
data set, the user may specify which variables of the data 
set are to be included in the test, and may designate blocks 
to be excluded. 

2« Progra m acti vat ion when Osing Cell Counts 

a. Initiation and test selection 

In CMS, type LOCHI. Enter "N” for no to question 
about previously entered data. Enter ''C'* for cell counts. 
Enter . . ."Z” for desired output file. When offered a 

choice of tests, enter 19. 

b. Cell count entry 

Lochinvar will ask how many treatments and how 
many blocks are included in the test. After those numbers 
are entered, Lochinvar will ask whether block 1, treatment 1 
was a success. Enter yes or no. The program will prompt 
for each treatment in block 1, then move to block 2, and so 
forth, through the b*k choices. The matrix of 1's 
(successes) and O's (failures) will be displayed, and the 
user has the chance to correct the values for any block (the 
program will prompt for entries of each treatment of a spec- 
ified block) . 

3* Piogra m Activ ation when Osing D ata Values 
a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 19. If data needs to be entered, 
enter '*N'* for first question, '’V* for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 19. Enter '* 0” , "V'*,. . . "Z" for desired ourput 
file . 
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b. Variable and Parameter Choice 

Lochinvar will ask if treatment 1 is in the 
test. If yes, then it will then ask the partition value for 
treatment 1, whether successes lie above or below that 
value. The same questions will be asked for each treatment. 
After each treatment is either excluded or given a partition 
value, Lochinvar will offer the user the chance to exclude 
blocks from the test. 

^ • TJSi Ex ecu tion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

5 . O utput Avail able to the User 

The program CRUNCH displays the output fils to which 
the data has been sent, e.g., ”U LISTING'*. The names of the 
data set and variables used are at the top of rhe output 
file. The test statistic I will be listed. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 • D escri P t ig n of Calcula tio n of T est Stati stic 

a. Calculation of Block and Treatment Totals 

If the user chooses the merhod of cell counts 
(paragraph 2, above) , Lochinvar will sum the successes over 
each of b blocks and over each of k treatments. These 
totals will be in the cells vector of line 3a of the options 
file (see Appendix D for details of rhe options file) , to be 
read by Crunch. 

If the user chooses to enter partition values 
(paragraph 3 above) , Crunch uses that information to compute 
the block and treatment totals. 

b. Computing the Test Statistic 

The numerator of the test statistic involves the 
sums of successes for each treatment. The total number of 
successes N is found by summing each treatment number of 



successes Ci; N divided by the number of treatments k is the 
average number of successes per treatment. Find the sum of 
squared deviations by summing over k treatments the square 
of (Ci-N/k) . Multiply that sum of squares by k(k-1) to get 
the test statistic numerator. 

The denominator of the test statistic is the sum 
over all b blocks of the number of successes in block i 
times the number of failures in block i. 

Divide the numerator by the denominator. 

2« P rogra m Method of 0 peration 

a. when Using Cell Counts 

In the procedure Cochran, Lochinvar prompts the 
answer of yes or no to the question of whether block i, 
treatment j, is a success. If yes, it increments the total 
of successes in the count for block i and treatment j. The 
total for block i is contained in the ith element of the 
vector CELLS; the total for the jth treatment is contained 
in the (j+b)th element of CELLS, where b is the number of 
blocks. The number of blocks (NROW) and number of treat- 
ments (NCCL) are written in Line 3 of the options file; the 
vector CELLS is written in Line 3a. The Crunch subroutine 
Celler reads those values, and sends them to the subroutine 
Cochrn, which computes the test statistic. 

b. When using Partition Values 

In the procedure Cochran, Lochinvar asks the 
partition value of each treatment included in the test; 
those values are entered in the odd numbered elements of 
vector PART_V, that is, 1 ,3,. . . , (2k- 1) , where k is the 
number of treatments. In the even numbered elements of 
PART_V is a +1.0 if successes are below the preceding parti- 
tion value, or -1.0 if the successes are above it. For 
example, PART_V of 3.0 -1.-0 7.0 1.0 means that there are two 
treatments; in the first treatment, successes are above 3.0, 
in the second, successes are below 7.0. The number of 



112 



blocks (NROW) and the number of treatments (NCOL) are 
written in the Line 3 of the options file; PART_V is written 
into line 3a. 

The Crunch subroutine Caller calls the subroutine 
Coch to use PARTLY and the data elements values to find the 
totals of successes for the blocks and treatments. It 
places those counts (in the same format as CELLS, describe 
in subparagraph a, above) into the vector RCCT. That vector 
is used by the subroutine Cochrn to compute the test 
statistic. 

3 • P roqra m Cha ra cteris tic s 

a. Limitations 

When the user enters cell counrs, the sum of the 
treatments and blocks cannot exceed 50 (b+k) , and the total 
number of cells (b*k) cannot exceed 400. 

When the program computes the treatment and block 
totals, the maximum number of data points is 400; the 

maximum number of treatments is 15. The sum of treatments 
and blocks (b+k) cannot exceed 50. 

b. Subroutines/Procedures Osed 

(1) Lochinvar; Cochran, Cel_test 

(2) Crunch; Coch rn ,Celler 

(3) External; Shsort 
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XXIV. KOLMOGOROV TEST 



A. OVERVIEW 

1 . G enera l Purpose 

With a sample of 30 or less data points, test to see 
whether the population from which the sample is drawn is of 
a specified distribution with specified parameters. The 
choice of hypothesised distributions is uniform, normal, 
exponential, weibull, and erlang (gamma with an integer 
shape parameter) . 

The Lilliefors test is identical to the Kolmogorov 
test except that it estimates parameters from the sample; 
normal and exponential distributions can be hypothesised. 

2. Level of Meas ureme n t 

Interval 

3 . A ssump ti ons 

The sample is a random sample from the population. 

4 . Hypoth es es 

With F* (X) as the theoretical distribution with 
specified parameters, and S (x) as the population’s urder- 
lying distribution function, the hypotheses are: 

a. Two tailed test 

(1) Null hypothesis hO: F*(x)=S(x) for all x 

(2) Alternative hypothesis hi: F*(x)^S(x) for 

at least one x in range of sample 

b. One tailed test 

(1) Null hypothesis hO; F*(x)<S(x) for all x 

(2) Alternative hypothesis hi: F*(x)>s(x) 

c. One tailed test 

(1) Null hypothesis hO: F*(x)>S(x) 

(2) Alternative hypothesis hi: P*(x)<S(x) 
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B. ACTION REQUIRED BY USER OF PROGRAM 



1 • User De cis io ns abou t P rog ram Options 

If a data set has more than one variable, the user 
must specify which is used. If the chosen variable has more 
than 30 elements, the chi-square goodness of fit test will 
automatically be given. Otherwise, the user must specify 
the distribution to use, and all its parameters, from among 
the fcllcwing: 

a. Normal (specify mu, sigma squared) 

b. Exponential (specify lambda=rate , scale) 

c. Uniform (specify a=lower bound, b=upper bound) 

d. Erlang (specify n=integer, shape; lambda=scale) 

e. Weibull (specify alpha=shape, lambda=scale) 

2- Pr ogra m Act i vation when Usi ng D ata Values 

a. Initial Choices 

See section III.C for a detailed explanation of 
the fcllcwing. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 20. If data needs to be entered, 
enter "N" for first question, "V for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 20. Enter '* U” , "V" , . . . "Z” for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar; 

(1) Enter index of variable to be used in test. 

(2) Enter number of theoretical distribution. 

(3) Enter parameters as prompted. 

(4) Enter letter of type hypothesis. 
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3 • Test Ex ecu tion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

4. Output Avail able to the User 

The program CRUNCH displays the output file to which 
the data has been sen*, e.g., "U LISTING'*. The names of the 
data sez and variables used are at the top of the ou-put 
file. The values of the largest difference F*(x)-S(x), the 
smallest (most negative) difference, and the greatest abso- 
lute difference are displayed. Depending on the hyporhesis. 
Crunch selects the proper value of these three as the test 
statistic, and displays that value. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . D escri Pt ion of Calc ulatio n of Test Statistic 

a. Empirical Cumulative Distribution Function (CDF) 
Sort the data elements, giving order statistics 

X(1) ,...,X(n), where n is the number of data elements. The 

empirical cdf equals 0 for values of X lower than the 

smallest X, X(1) . At X(1) , the cdf jumps to 1/n; the cdf 
increases by 1/n at each new order statistic. At X (n) , the 
empirical cdf equals 1, and remains 1 for all values of x 

greater than the sample. The empirical cdf is stored in the 

vector S. 

b. Points on the Theoretical CDF 

With a specified distribution and parameters, 
find the value of the cdf for each value of the order 
statistics X (1 ),..., X (n) . For example, with exponential 
(lambda=2) specified and X=1.5, the value of the theoretical 
cdf is 1 . 0-exp (- (2* 1 . 5) ) =0.9502. Store these values in the 
vector F. 
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CDF* s 



c. 



Differsncs between Empirical and Theoretical 



with a sample of size n, there are 2n differences 
between S and F. For example, at X(1), S jumps from 0 to 
1/n, so that F(1) is compared to both values. At each order 
statistic value, the theoretical distribution F (i) is 
compared to S(i-1) and S (i) . Crunch keeps track of the tmin 
(most negative difference, or if all are positive, the 
smallest positive) , and tmax (most positive, or if all are 
negative, the closest to zero), 
d. Test Statistics 

For a two tailed test (hypothesis A), the test 
statistic is the larger of the absolute values of tmax and 
tmin. Fcr hypothesis 3, tmax is the test statistic. For 
hypothesis C, tmin is the test statistic. 

P roqra m Met hod of 0 perat i on 

The subroutine Kolmog compute the empirical cdf, by 
a do for loop i=1 to n, with S(i)=i/n. Shsort orders the 
sample values to form order statistics. The theoretical 
cdf’s are found in one of the following subroutines: normf, 

expf, unif, erlf, and weibf. 

The subroutine Findt finds the differences between S 
and F for each value of X. For i=1, F is compared to 0 and 
S(1) . Then, in a do for loop i=2 to a, F (i) is compared to 
S(i) and S(i-1). These 2n difference are stored in the 
vector T. Another do for loop, j=l to 2n, finds tmax and 

tmin. From those two values, the greatest absolute differ- 
ence is found. 

2 • P roqra m Cha r a cteri s tie s 
a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables/treatments: 15. 
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b. Subroutines/Procedures Used 





(1) 


Lochinvar 


; Kolfflog, Qpar, Wpar, 


Erpar 


Expar 












(2) 


Crunch; 


Kolmog, Findt; Normf, 


Unif 


Erlf , 


Heibf 










(3) 


External: 


Shsort 





Jlp ar , 
Expf , 
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XXV. LILLIE? ORS TEST 



A. OVERVIEW 

1 • Genera l Purp ose 

With a sample of less than 31 elements, this tests 
whether the sample came from a specified distribution, 
either a normal or exponential. The parameters used for the 
test are the maximum likelihood estimates (MLE) , derived 
from the sample mean and variance. Those two values are 
computed and displayed in the output file; however, 
Lilliefors tests only the distribution type, not the param- 
eter value. (Any value other than the MLE would have a 
higher probability of rejecting the null hypothesis). 

If specific parameters must be tested for, use the 
Kolmogorov test. 

2* Level of Measurement 

Interval 
3 . Assump tio n s 

The data values are a random sample of the popula- 
tion . 

U. H ypoth esis 

With P*(x) as the hypothesised distribution func- 
tion, with MLE parameters, and S(x) as the actual population 
distribution 

a. Null hypothesis 

HO: F*(x)=S(x) for all x 

b. Alternative hypothesis 

HI: F*(x)#S(x) for at least one x 
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B. ACTION REQUIRED EY USER OF PROGRAM 

1 De cis io ns about P rogra m Options 

If more than one variable is in the data set, the 
user must specify which one to urilize for the test. If 
that variable has more than 31 elements, the goodness of fit 
test will be offered automatically. Otherwise, choose 
whether to test for the normal or the exponential distribu- 
tion . 

2* Pro qra m Act i vation whe n Using Values 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 21. If dara needs to be entered, 
enter ”N" for first question, ”V” for second; when data 

entered, choose to do a test; when offered a choice of 
tests, type 21. Enter •• U'',”V»',. . . »'Z" for desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter index of variable to be used in test, 
if there is more than one in the data set. 

(2) Enter whether to test against the best 
fitting normal or exponential distribution. 

T est Execution 

Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

4 . Output Avail able to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING’*. The names of rhe 
data set and variables used are at the pop of the output 
file. The sample mean and variance are listed, as well as 
the test statistic. 
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C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . D escri pt ion of Calc ulatio n of Test St at ist ic 

a. Finding the MLE Parameters 

The sample mean and variance are computed. If 
the test is for normality, the theoretical distribuition* s 
parameters mu and sigma squared are estimated by the sample 
mean and variance respectively . If the exponential is the 

theoretical distribution, its parameter lambda is the recip- 
rocal of the sample mean. 

b. Calculation of the Test Statistic 

The remainder of the test is exactly the same as 
the two- tailed Kolmogorov test. The empirical distribution 
function is generated using (1/n) increments for each of the 
n sample data elements. The difference between the empir- 
ical and theoretical distribution is computed at each value 
of X contained in the sample. The greatest absolute differ- 
ence is the test statistic. 

2 . P rogra m Ms t hod of 0 per at ion 

The subroutine Lilief calls Muvar to find the sample 
mean and variance, then assigns MLE values to the rp vector. 
The subroutines Norf cr Expf, as appropriate, are called to 
generate values of the theoretical distribution at the 
values of X (1 ) , .. . X (n) . The subroutine Findt finds the 
largest, smallest and greatest absolute difference; that 
last value is the test statistic. 

3 . P rogra m Ch a racteri s tic s 

a. Limitations 

Maximum number of data points: 30, in one vari- 

able. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Lillie 

(2) Crunch: Lilief, Muvar, Expf, Norf , Findt 

■(3) External: Shsort 
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XXVI. SHAPI5D WILK TES T FOR NOR MALITY 

A. OVERVIEW 

^ • Genera l P urp ose 

With a sample of 30 or less data points, test to see 
if the sample is from a normal distribution of unspecified 
parameters. Although Crunch computes the sample mean and 
variance, they are not displayed. No statement about the 
population parameters is made by this test. 

2. Level of Me a sure men t 
Interval 

3 . A ssump tions 

The sample is randomly drawn from the population. 

4 . H ypoth esis 

with F*(x) as a normal distribution of unspecified 
parameters, and S(x) as the distribution function of the 
population, 

a. Null hypothesis 

HO; P*(x) =S(x) for all x 

b. Alternative hypothesis 

HI; F*(x) #S(x) for at least one x 

B. ACTION RE3UIRED BY USER OF PROGRAM 

1 . Dser De cis i o ns about Program Options 

If there is more than one variable in the test, the 
user must specify which one is to be utilized. If that 
variable has more than 30 elements, the goodness of fit test 
will be automatically offered. If not, no further choices 
are necessary. 
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2 . P roqra m Act iv ati on when Usin 3 ^ ta Va lu es 

a. Initial Choices 

See section III.C for a derailed explanation of 
the fcllcwing. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y” to first question, then enter file name; 
when offered a test, type 22. If data needs to be entered, 
enter "N" for first question, "V" for second; when data 

entered, choose to do a test; when offered a choice of 
tests, type 22. Enter •' U”, "V", . . . ’’Z" for desired output 
file. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 

Enter index of variable to be used in test, if 
there is more than one in the data set. 

^ • Test Execut ion 

To perform the test, after the above entry of data 
and parameters, type CRUNC3 while in CMS. 

5. O utput Avai lable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "0 LISIING”. The names of the 
data set and variables used are at the top of the output 
file. The test statistic will also be included. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . Desc ri ption of Cal culat ion of Test Stati stic /Leve l 
of Si gni fic anc e 

a. Compute the Sum of Squared Deviations 

The sum of squared deviations is the sum over n 
of the square of the difference between X(i) and the mean of 
X. The program actually figures the sample variance and 
multiplies by (n-1) . This is the denominator of the test 
statistic. 
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b. Compute the Sum of the Weighted Differences 

The sample is sorred into the order statisrics, 

X(n). Subtract values with the same depth from 
the end points, e.g, subtract X(n) - X(1), X(n-1) 

X (2) , . . . ,X (n-i+ 1 ) - X(i). The n is even, there are n/2 

differences; if n is odd, there are (n-1) /2 differences, 
with the median excluded. Multiply these differences by 
a(i) weights, provided by Lochinvar. Sum the weighted 
differences. This is the numerator of the test staisric. 

c. Compute the Test Statistic 

Divide the sum of weighted difference by the sum 
of squared deviations. 

2- Prq qra m of 0 peration 

The subroutine finds the sample variance by calling 
Muvar. Shsort is used to order the sample into order 
statistics X(1),...,X(n). A do for loop, i=1 to trunc(n/2), 
is used to calculate the sum of a (i) * (X (n-i+1 ) -x (i) ) ; the 
a(i)’s are contained in the rp. vector. 

3 • Proqra m Char acteri s tic s 

a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables/treatme nts : 15. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Wilkes 

(2) Crunch: Wilksh, Muvar 

(3) External: Shsort 
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XXVII 



SMIRNOV TEST 



A. OVERVIEW 

^ • Genera l Paro ose 

With two samples, X and Z, each drawn from a 
different population, test whether the underlying population 
distributions P(x) and G(y) could be equal over the enrire 
range of X and Y. 

The Smirnov and Cramer-von Mises test differ only in 
the computatien of the test statistic, and therefore, the 
choice of hypotheses, 

2* Level of Mea sure men t 

Ordinal 

3. A ssump tions 

The samples are drawn randomly from their popula- 
tions, and are rankable against each other as well as within 
samples. 

4 , Hypoth eses 

a. Two tailed test 

(1) Null hypothesis h3: F(x)=g(x) for all x 

(2) Alternative hypothesis hi: F(x)^g(x) for at 
least one x in range of sample 

b. One tailed test 

(1) Null hypothesis hO ; F(x)<g(x) for all x 

(2) Alternative hypothesis hi: F(x)>g(x) 

c. One tailed test 

(1) Null hypothesis hO : F(x)>g(x) 

(2) Alternative hypothesis hi: F(x)<g(x) 
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B. ACTION REQUIRED BY USER OF PROGRAM 

1 Di£i§i2i:J P roqr aai Options 

The user must designate which of two variables are 
used, if the data set contains more than two variables. The 
user specifies the type of hypothesis. 

2* P rogra m Act ivation when Using Data Valu es 

a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter '•Y” to first question, then enter file name; 
when offered a test, type 23. If data needs to be entered, 
enter "N" for first question, ”V" for second; when data 
entered, choose to do a test; when offered a choice of 

tests, type 23. Enter "U",''V", "Z" for desired output 

file . 

b. Variable and Parameter Choice 

When prompted by Lochinvar: Enter indices of 

variables to be used in test. 

3 • Test Ex ecu ti o n 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

4. Output Ava i lable to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. The test statistic will also be listed. 
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c 



ACTIONS PE8F0BMED BY THE PROGRAM CRUNCH 



• Descri ption of Calc ulatio n of T es t Stati stic 

a. Calculate the Difference Vector S 

X and Y are sorted into order statistics, 
X (1 X (m) , and Y (1) , . . . , X (n) . The counter for X is i, 
the counter for Y is j. Compare X(1) with Y(1) . If X(1) < 
Y(1) then increment i; if X (1) > Y(1) then increment j. The 
difference between the empirical distribution functions is 
(i-1)/m minus (j-1) /r. After each comparison of X (i) and 
Y(j) the appropriate counter is incremented and the differ- 
ence is calculated and stored in the vector S. The counter 
i is incremented until it reaches (m+1); j can reach (n+ 1) . 
If X(i) = Y (j) , increment i, calculate and store the differ- 
ence, then increment j, and compute and store the differ- 
ence. There will be (m+n) elements in S. 

b. Calculate the Test Statistic 

For a two tailed test (hypothesis A) , the 

greatest absolute difference is the test statistic. For 

hypothesis B, the largest difference is the test statistic. 
For hypothesis C, the smallest difference is the test 
statistic. 



2* P rogra m Meth od of 0 perat ion 

The subroutine Smirn calls Shsort to rank X, then 
rank Y. It calls xhe subroutine Finds to calculate the 
difference vector S, then chooses the test statistic based 
on the hypothesis specified. 

3. P roqra m Cha ra cteri s tic s 

a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables: 2. 

b. Subroutines/Procedures Used 

(1) Lochinvar: Smirnov 

(2) Crunch: Smirn, Finds 

(3) External: Shsort 



127 



XXVIII. CRA.^ ER-VON MISBS TEST 



A. OVERVIEW 

1 . Genera l Purpose 

with two samples, X and Y, test whether the under- 
lying population distributions F(x) and G(y) could be equal 
over the entire range of X and Y. 

The Smirnov and Cramer-von Mises test differ only in 
the computation of the test statistic, and therefore, the 
choice of hypotheses. 

2 • Level of Me asure men t 

Interval 

.3. A ssump ti on s 

The samples are randomly drawn from the respective 
populations, and are independent of each other. 

4. H ypoth es is 

a. Null hypothesis 

HO: F (X) =G (X) for all x 

b. Alternative hypothesis 

HI: F(x)=G(x) for at least one x 

B. ACTION RE30IRED BY USER OF PROGRAM 

1 . U ser Pe cis io ns about P rog ram Optio ns 

If there are more than two variables in the data 
set, specify which two are in the test. 

2. P ro ora m Ac t i vat ion wh en Using Data Values 

a. Initial Choices 

See section III.C for initial choice details. 

In CMS, type LOCHI. If data is already in a data 
file, enter '*Y** to firs- question, then enter file name; 
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when offered a test, type 28. If data needs to be entered, 
enter ”N" for first question, ••V” for second; when data 
entered, choose to do a test; when offered a choice of 
type 28. Enter " 0” , ”V" , . . . •'Z” for desired output 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 

Enter indices of variables to be used in test., 
if the data set has mere than two. 

3 • Test Ex ecu tion 

To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 

U. O utput Avail able to th^ U se r 

The program CRUNCH displays the output file to which 
the data has been sent, e. g . , ”U LISTING". The names cf the 
data set and variables used are at the top of the output 
file. In addition, the test statistic is listed. 

C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 

1 . D escri ption of Calc ulatio n of Test Statistic 

a. Calculate the Difference Vector S 

This step is exactly the same (using the same 
subroutine) as in the Smirnov test. See section XXVII for 
deta ils. 

b. Calculating the Test Statistic 

Find the sum of the squared elements of the S 
vector. With m as the number of elements of X, and n as the 
number of elements of Y, multiply the sum of squared S (i) 
times (m*n) / (a +n ) **2 . This is the test statistic. 

2 • Proqra m Me t h cd of 0 perat iq n 

Thesubroutine Crmvon calls Shsort to order X and 
order Y into order statistics. The subroutine Finds finds 
the vector S. In a do for loop, i=1 to (m+-n) , Crmvon 



tests , 
file . 
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That sum is then multiplied by 



squares and sums s (i) . 

(m*n) / (m + n) **2 . 

3 • Pr ogra m Char a ct er i s tic s 

a. Limitations 

Maximum number of data points: 400. Maximum 

number of variables: 2. 

b. Subroutines/Procedures Osed 

(1) Lochinvar: Cramvon 

(2) Crunch: Crmvon, Finds 

(3) External: Shsort 
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APPENDIX A 

TUTORIAL; LINKING TO AND USING THE COMPUTER PACKAGE 

A. PURPOSE 

This appendix contains three things: instructions on 

how to link to the class disk to perform nonparametric 
tests; instructions on how to copy the programs onto a 
user's A disk; and an example of how to use the program, 

B. LINKING TO THE CLASS DISK 

To perform the test, a student can link to user number 
2266p, read only password OBRIEN, and use the programs there 
to perform nonparametric tests. The steps are; 

• L in kin g to the Clas s Disk 

in CMS, type "CP LINK 2266P 191 192 RR”. The 

computer will type back ENTER READ ONLY PASSWORD. Enter 
"OBRIEN". After receiving back the "R" message, enter ACC 
192 B/A, 

2 . Usin^ the P rogr ams 

An example of the use of Lochinvar and Crunch is 
given belcw. Detailed instructions for use are given in 
section III and the sections on specific tests. 

3 . Rel eas ing the Class Disk 

After you are dons, type RELEASE 192. 

C. COPYING THE PACKAGE 

Three files are needed to run nonparametric tesrs; 
"LOCHI MODULE", "CRUNCH TEXT", and "CRUNCH EXEC". The first 
two are compilations of the source code, contained in "LOCHI 
PASCAL" and "CRUNCH FORTRAN". To copy any of these 
programs: 



132 



1. Link to the Class Disk 

2. Copy the File: Type XEDIT <filename> <filetype>. Type 

SAVE = = A in the space marked ==> at the bottom of the 
screen, then FILE. The file will be on the A disk. 

3. Release the Class Disk 



D. AN EXAMPLE OF TEST DS E 



To fully understand this example, read or review 
sections III and XXI of this text. The data set used is 
from Conover [Ref. 1: p.319]; the tasr used is the Friedman 
test. The example shows the program’s prompting in capital 
letters, and the user's response in lower case, with a <== 
to the right. 

1 . E nteri ng the Dat a F ile 

a. Preliminary Steps 

There are four initial steps to perform before 
entering a data file. The first is to activiate Lochinvar, 
then to request not to use a previously entered file, then 
to enter the file by data values, and finally to write the 
information to data file "lochi a”. 

lochi <== (1) 

EXECUTION BEGINS 

DO YOU WANT TO PERFORM A TEST ON A PREVIOUSLY ENTERED DATA SET? 

ENTER Y OR N 
n <== (2) 

YOU HAVE A CHOICE OF TWO METHODS TO ENTER DATA: 

1. ENTER EACH VALUE OF DATA SET. VALUES MUST BE NUMERICAL, AND 

MEASUREMENT AT LEAST ORDINAL. ALL TESTS MAY BE PERFORMED THIS WAY. 

2. ENTER ONLY THE COUNT OP SUCCESSES/FAILURES OR ROW/COLUMN CELL 

COUNT. DATA VALUES AND LEVEL OF MEASUREMENT MAY BE NOMINAL, BUT 

COUNTS MUST BE NON-NEGATIVE INTEGERS. THE FOLLOWING TESTS MAY BE 

PERFORMED: BINOMIAL, MCNEMAH, R*C CONTINGENCY, CHI-SQUARE 

GOODNESS OF FIT. 

ENTER V FOR VALUE OR C FOR COUNT 

V <== (3) 

ENTER FILETYPE (A-F) TO WHICH DATA GOES 
a <== (4) 

b. Enter Information of Label of Data Set 

The label needs three types of information: the 

data set name, the number of variables, and rhe variable 
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names. For this example, we are using the data of exercize 
1, page 309 in Conover, with four variables, named the names 
of the seasons: 



ENTER NAME OF DATA SET 

prob 1,p 309, number of births by hospital <== (1) 

ENTER NUMBER OF VARIABLES OR TREATMENTS 
4 <== (2) 



ENTER NAME OF VARIABLE NR 1 

winter <== (Baf 

ENTER NAME OF VARIABLE NR 2 

spring <== (3b) 

ENTER NAME OF VARIABLE NR 3 

summer <-= (3c) 

ENTER NAME OF VARIABLE NR 4 

fall <== (3d) 



c. Enter Data Elements 



Next, enter the data when prompted. Enter either inte- 
gers or real numbers, i. e. , with or without decimal points; 
the program will read them as reals. Leave a space (not a 
comma) in between numbers.. Enter all elements of a variable 
on one line if possible; if not, there will be a chance to 
add more elements after the "ENTER" key is pressed. There 
will also be a chance to check or correct entries. In the 
following example, a mistake is made, rhen corrected, in 
entry of variable one. In variable two the data is checked 
and found correct; in variable three it is not checked. In 
variable four, the first four data elements are listed, the 
"ENTER" key is pressed, the last three data elements are 
added, and the data checked. 



(1) Enter, Check, and Correct the Values of 
Variable One: 



WHEN ENTERING DATA ELEMENTS <1, START WITH 0. 

ENTER VALUES FOR VARIABLE 1 

92 9 98 19 21 58 43 <== 

ADD MORE ELEMS TO SAME VARIABLE? 

OR CHECK THIS VAR? OR GO TO NEXT? 

ENTER A (ADD),C (CHECK)OR G(GO ON) 

C <= = 

NUMBER OF ELEMS ENTERED 7 

NUMBER OF ELEMS IN THIS VAR 7 

92.000 9.000 98.000 19.000 21.000 58.000 43.000 
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(The last entry should have been '42' vice '43', 
so re-enter variable:) 

WANT TO EE-ENTER THIS VARIABLE? 

ENTER Y_CR N 

enIer values for variable 1 

92 9 98 19 21 58 42 
ADD MORE ELEMS TO SAME VARIABLE? 

OR CHECK THIS VAR? OR GO TO NEXT? 

ENTER A (ADD), C (CHECK)OR G ( GO ON) 

C <=== 

NUMBER OF ELEMS ENTERED 7 

NUMBER OF ELEMS IN THIS VAR 7 

92.000 9.000 98.000 19.000 21.000 58.000 42.000 

WANT TO RE-ENTER THIS VARIABLE? 

ENTER Y OR N 
n <== 



(2) Enter, Check Values for Second Variable: 



ENTER VALUES FOR VARIABLE 2 

1 12 1 1 109 26 22 71 49 <== 

ADD MORE ELEMS TO SAME VARIABLE? 

OR CHECK THIS VAR? OR GO TO NEXT? 

ENTER A (ADD) ,C (CHECK)OR G(GO ON) 

C <== 

NUMBER OF ELEMS ENTERED 14 

NUMBER OF ELEMS IN THIS VAR 7 

112.000 11.000 109.000 26.000 22.000 71.000 49.000 

(This time, it is right the firsr time, 
so no re_entry:) 



WANT TO RE-ENTER THIS VARIABLE? 
ENTER Y OR N 
n <== 



(3) Enter Values for Third Variable: 

ENTER VALUES FOR VARIABLE 3 

99 10 92 19 23 51 44 <= = 

ADD MORE ELEMS TO SAME VARIABLE? 

OR CHECK THIS VAR? OR GO TO NEXT? 

ENTER A (ADD) , C (CHECK)OR G ( GO ON) 
g<== 

(4) Enter, Add to. Check Values for Fourth 

Variable: 



ENTER VALUES^FOR VARIABLE 4 

77 12 €1 18 <== 

ADD MORE ELEMS TO SAME VARIABLE? 

OR CHECK THIS VAR? OR GO TO NEXT? 

ENTER A (ADD) ,C (CHECK) OR G ( GO ON) 
a <== 

ENTER VALDES FOR VARIABLE 4 

24 62 41 <= = 

ADD MORE ELEMS TO SAME VARIABLE? 

OR CHECK THIS VAR? OR GO TO NEXT? 

ENTER A (ADD), C(CHECK)OR G(GO ON) 

NUMBER OF ELEMS ENTERED 23 

NUMBER OF ELEMS IN THIS VAR 7 

77.000 12.000 81.000 18.000 24.000 62.000 

WANT TO RE-ENTER THIS VARIABLE? 



41.000 
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ENTER Y OR N 
n <== 

(5) Lochinvar offers a chance zo re-enter the 
entire data set 

YOU HAVE ENTERED 4 VARIABLES, 

WITH THE FOLLOWING NUMBER OF ELEMENTS IN EACH 
VAR NR NR OF ELEMENTS 

1 7 

2 7 

3 7 

4 7 

DO YOU WISH TO RE-ENTER THE ENTIRE DATA? 

ENTER Y OR N 
n <= = 

(6) Lochinvar Determines the Variables are of 
equal size: 



THE VARIABLES/TREATMENTS ARE OF EQUAL SIZE: 

PAIRED COMPARISON TESTS ARE POSSIBLE 

(7) Choice to do a Test with this DATA 

NOW, DO A TEST WITH THIS DATA SET? 

ENTER Y OR N 
y <== 

2* C hoice of Test, Test Parame te rs , Output File 
a. Choice of Output Files 

The relevant choices are 'U LISTING’, ‘V 
LISTING' z LISTING'. Anything char is already in the 
file will be writren over. 

WHICH NAME FOR OUTPUT FILE, U-Z? 
u < = = 

b. Choice of Tesr 

Lochinvar will offer test according to the number 
of variables in the data set and whether the variables are 
of equal length. Test number 17 is desired; rhe Friedman 
rather than the Durbin is chosen. 



PICK AMONG THE FOLLOWING N ON- PAR AMET RIC TESTS: 

1. BINOMIAL 

2. QUANTILE 

3. COX-STUART TEST FOR TREND <2 VARS,PAIR> 

4. MANN-WHITNEY 
5. SIGN <PAIR> 

6. MCNEMAR <PAIR> 

7. R*C CONTINGENCY TABLE (CH I-SQU ARE) <?AIR> 

8. GOODNESS OF FIT, N>14 ?CHI-SQUAR3) 

9. WILCOXON SIGNED RANK <?AIR> 

10. RANK CORRELATION (KENDALL S SPEARMAN) <PAIR> 

11 . NON-PAEAMETR IC LINEAR REGRESSION <?AIR> 

12. NON-PARAMETRIC MONOTONE REGRESSION <PAIR> 
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13. MEDIAN 

14. KRUSKAL/WALLIS 

15. SQUARED RANK TEST FOR EQUAL VARIANCE 

16. HARTLEY TEST FOR EQUAL VARIANCE 

17. FREIDMAN OH DURBIN <BLOCKED> 

18. QUADE <BLOCKED> 

19. COCHRAN <BLOCKED> 

20. KOIMOGOROV(ONE S AMPLE : PA RAM ETERS SPECIFIED 

21. LILLIEFORS (ONE S AMPLE; M ORMAL OR EXPONENTIAL) 

22. WILKES-SHAPIRO TEST FOR NORMALITY 
23. SMIRNOV (2-SAMPLE) 

24. CRAMER-VON MISES (2-SAMPLE) 

DO YOU WANT TO SEE THE LIST AGAIN? 

ENTER Y OR N 
n < = = 

ENTER THE NUMBER OF A TEST LISTED ABOVE 
17 <== 

THIS WILL ALLOW THE FRIEDMAN OR THE DURBIN TEST 
<DURBIN IS INCOHPETE BLOCK DESIGN: NEEDS FILLER> 

DO YOU WANT TO PERFORM THE FRIEDMAN TEST? 

ENTER Y OR N 

y <== 

c. Choice of Test Parameters 

For this example, only the first, second and 
fourth variables are used in the test, and the third block 
of the data set (i.e., the third element of each variable) 
is excluded from the test. The final test parameter entered 
is the level of significance. 



(1) Variables Included in Test 

WHEN ENTERING PARAMETERS <1, START WITH 0. 
DO YOU WANT ALL 4 INCLUDED IN TEST 
ENTER Y OR N 
n <== 

HOW MANY VARIABLES ARE INCLUDED IN TEST? 

3 <== 

ENTER INDICES OF VARIABLES TO BE IN TEST 
ENTER INDEX OF INCLUDED VARIABLE 1 
1 <== 

ENTER INDEX OF INCLUDED VARIABLE 2 
2 <== 

ENTER INDEX OF INCLUDED VARIABLE 3 

4 <= = 

(2) Exclude Third Block 

DO YOU WISH TO EXCLUDE ANY BLOCKS FROM TEST 
ENTER Y OR N 

Ho8 many BLOCKS DO YOU WISH TO EXCLUDE? 

UP TO 7 BLOCKS 
1 <== 

IN ASCENDING ORDER, ENTER INDICES, 1 - 7 

ENTER INDEX FOR EXCLUDED BLOCK NR 1 
3 <== 

(3) Specify Level of Significance 

ENTER LEVEL OF SIGNIFICANCE, ALPHA 
ENTER NUMBSR>0, IF <1, START WITH 3. 

0.05 



137 



3 , 



Sun Test wi th Prog ram C runc h 
In CMS, type CRUNCH 
• Check O utp ut Fil e 

Crunch will ccmput the test s tatistic ,then display 

THE RESULTS AES LISTED IN FILE 
U LISTING 

To check the .results, type XEDIT 3 LISTING 
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APPENDIX B 

THE PROGRAM LOCHINVAR 



A. Purpose 

This appendix is intended to assist in modifying the 
program Lochinvar, or in implementing it on a computer 
system other than the IBM-3033 at the Naval Postgraduate 
School. The last part of this appendix, general description 
of program operation, may be of interest to a user who is 
experienced in the use of Lcchinvar. 

Section III of the text describes the general use of the 
program, while Sections V through XXVIII give specific 
instructions on responding to Lochinvar's promptings for the 
various tests. Appendices C and D explain in detail 
Lochinvar's two output files, the data file and the options 
file, respectively. 

The source code is contained in Appendix H. 

B. DESCRIPTION OF PROGRAM CHARACTERISTICS 

1 . L angua ge 

Lochinvar was originally written and tested in 
Waterloo Pascal; it is presently in a form compatible with 
the IBM virtual system Pascal compiler, PASCALVS. The 
program's syntax is consistent with the Pascal described by 
Zaks C8ef. 3]. 

2. Input a nd 0 utput Fi les 

In Lochinvar, input or output files are referred to 
by one of nine text definitions. "Qin" refers to terminal 
input; "qour” refers to terminal input; "op" refers to the 
options file, "lochi op". Text "a" through "f" refer to 
data files "lochi a" through "lochi f"; these data files are 
used both as input and output. 
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Prior to using a file for output, Lochinvar rous*: 
open that file by the '‘rewrite'* command. For the data 
files, this occurs in the procedure New_data; for rhe 
options file, it occurs in the procedure File_io. All seven 
of those files are explicitly defined as fixed format, 
logical record length 80. The form of the rewrite command, 
here illustrated for *'lochi a'*, is: 

REWRITE (A, ' NAME = LD CHI . A . A, R E CFM = F , LRECL = 80 ' ) 

The terminal output is opened in the main program, 
using the command " TERMOUT ( QOUT) **. 

Prior to reading a file as input, Lochinvar must 
open that file for reading with the "reset" command. For 
the data files, this occurs in the procedure Check_file. 
The terminal input file must be reset prior to each time 
data is entered from the keyboard. Rather than having reset 
commands scattered throughout rhe program, the procedure Rst 
is invoked. Rst resets the terminal for input with rhe 
command "TERMIN (QIN) ". 

3 • Dimensions of Da^ Structur es 

At the beginning of Lochinvar are six constants. 
Tot_data defines the total number of data elements contained 
in the data set; it is currently set at 400. Tot^vars is 
the maximum number of variables in a data set plus one; it 
is currently set at 16 (for 15 variables) . L^names is the 
length of variable or data set names; it is currently set at 
50. L_r_par, I*_i__par, and L_e_b are the number of real 
parameters, the number of integer parameters, and number of 
excluded blocks, respectively; each is currently set at 50. 

Changing the setting of any one of those numbers 
will change all subsequent dimensioning within the program. 
It is not recommended that L_names be increased beyond 30, 
since this would place names on more than one line (card 
image) of the data or options file. 
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^ • Ty P'^ and V ar iabl e D eclara rijn s 

Below the constant declarations, the type declations 
define arrays, sets and types. Below that are variable 
declarations. All parameters are defined globally. 

5* De scriptio n of Vari ables . Arrays, Set s 

Below the variable declarations, a comment section 
gives brief descriptions of each variable, set or array 
passed as a parameter. 

6* Descri pt ion of Prqc edures and Fu nctions 

Each subprogram elemenz contains a brief comment 
section describing its purpose. 

C. DESCRIPTION OF PROGRAM OPERATION 

The main program offers the user three choices: use a 

previously enter data set to perform a test, enter a new 
data set, or enter cell counts. See Section III of the text 
for a complete explanation of these choices. Depending on 
the user's choice, the main program will call one of three 
master subprograms tc prompt the user through the data and 
parameter entries: 

1 . N ew Da ta 

The master subprogram New_Data prompts entry of a 
new data set and offers the user the chance to perform a 
test with the newly entered data. Steps in this process: 

a. Data Set File Type 

New_data asks the filetype for the data, "a” 
through "f". 

b. Names and Number of Variables 

New_data calls Namer, which asks the data set 
name, the number of variables, and the variable names. 

c. Data Element Values 

If there are two variables, New_data offers the 
choice of entering data pair by pair. If the user chooses to 
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do this, New_data calls E_pairs. Otherwise, the procedure 
E_by_var prompts entry of data variable by variable. 

d. Writing of Data File 

New_data writes the data sen file. 

€. Test Choice 

New_data asks if a test should be performed with 
this data. If yes, File_io and Which_test are called; these 
procedures are described in the next paragraph. 

2. O ld da ta 

If the user indicates than a previously entered data 
set is to be used, the procedure Old_data is called. It 
perforins the following: 

a. Data File Choice 

01d_data asks the user's choice of file type of 
old data, "lochi a" through "lochi f". Old_data calls 
Chech_file, which read and displays the name of the indi- 
cated data sat. If it is the correct data set, it also 
reads the number of variables and number of elements per 
variable. 

b. Output File Choice 

The procedure File^io asks to which output fils 
the test results should be sent. It writes the number of 
the data set file (NIF) and the number of the test results 
file (NOF) to line one of the options file. 

c. Test Selection and Parameter Choice 

01d_data calls the procedure Which_test. This 
procedure offers some or all of the nonpar ametric tests, 
depending on the number of variables in the data set and 
whether the number of elements per variable are equal. Once 
the choice is made, Which_test writes that choice to line 2 
of the opricns file, along with whether the test involves 
cell counts. Which_tesn then calls the procedure for the 
chosen test. 
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however. 



d. Test Variable and Parameter selection 

Each test procedure is somewhat different, 
there are common points. If the test involves 
cells, the test procedure prompts cell related parameters, 
then calls W^cells to write those parameters to line 3 of 
the options fils (see appendix D for details of that file) . 
If the data set contains more variables than the minimum 
needed for the test, Hhich^vec will prompt the user to enter 
which variables are included in the test. If the test 
involves paired or blocked data, the procedure Exclud_b 
offers the user the chance to exclude pairs or blocks from 
the test; in other cases, None^ex writes a "0" to line 5 of 
the options file. The test procedure then prompts entry of 
various integer and real parameters, and calls the procedure 
W_param to write those parameters to lines 6 and 7 of the 
options file. 

3 . C el te st 

If the user specified to enter cell counts, the 
procedure Cel_test is called by the main program. It calls 
File_io to write the NIF and NOF to the data file. It calls 
Namer to get the data set and variable names. The choice 
test dictates which procedure prompts the cell count entry. 
Cel_tesr writes the data set name and variable names to line 
8 of the options file. 
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APPENDIX C 
THE DATA FILE 



A. PURPOSE 

This appendix explains the structure of the data files, 
which contain the data set and variable names, rhe number of 
variables and number of elements per variable, and the data 
elements of the variables. This will aid the user in under- 
standing the computer package. More importantly, it will 
allow the user to test data generated by another program, or 
read from cards or tape, without having to load the data 
into Lochinvar by hand. 

B. METHOD OF WRITING DATA FILES 

1. Fo rma ts of Data File C ompo nents 

All names are written in character fields of length 
48, from the first to the forty-eighth position on a line. 
All integers are written in a 1615 format, i. e. , with the 
least significant digit of the integer on position 
5, 10,.. .,80. All data elements are read as real numbers; 
they are written to be read in an 8F10.3 format. This means 
that there are eight fields for real numbers per line, 
1- 1 0 , 1 1-20, . .. , 7 1-80 , and the real numbers have three digits 
to the right of the decimal point, e.g., 789.123. Lochinvar 
includes a decimal point in all real numbers; if this is not 
possible, the least significant digit of an entry (the 0.001 
digit) must be right justified in the field, i.e., aligned 
on column 1 0,20 , . . . , 80. 

If the user has a data set whose data elements are 
all integers, there is a way to have the data file written 
without oonverting all the integers to real numbers. The 
least significant digit of the integers can be aligned on 
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columns 7,17,... 
as real numbers 
data . 



,77, and the program 
with the same value 



Crunch will 
as original 



sad them 
integer 



If the data elements of the data set are either all 
very large or all very small, they may not fit into the 
F10.3 format. A quick fix to that problem is to rescale the 
values, e.g., multiply very small numbers by 1000, or divide 
large numbers by that amount. The validity of the statis- 
tics will not be effected by that transformation. 

If there are both very large and very small numbers 
in the data set, there will be numerical errors in single 
precision Fortran for any tests which use an interval level 
of measurement. For tests which use ordinal or nominal 
level of measurement, any monotonic transformation, such as 
y=ln X, could be used to allow both very large and very 
small data values to fit into the F10.3 format. 

2. L ayout of t he Da^ Fil e 
a. Data Set Name 

The data set name occupies the first line of the 
data file. The name is from 1 to 49 characters long. On 
the same line as the name, in the forty-ninth position, 
there is the character "!"» which on IBM keyboards is the 
upper case "1". For example, a data set file name: 



This is a data set name 



b. Number of Variables and Elements per Variable 

On the line following the data set name, there is 
a line of integers. The first integer is the number of 
variables in the data set. The second integer is the number 
of elements in the first variable; the third integer is the 
number of elements in the second variable, and so on. There 
must be non-zero ' integers in the first two positions. 
Lochinvar writes zero where there are no values, bur that is 
not necessary. As mentioned above, integers are right 
justified on position 5, 10, *..,80. 
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Let the data set consist of two variables. The 
first variable has two data elements, the second has four, 
the first two lines would appear as: 



This is a data set name 
2 2 4 0 0 



0 . . 



c. Data elements 

The data elements are listed variable by variable, 
i.e., all the elements of the first variable are listed, 
then all of the second, et cetera. As mentioned above, the 
data elements are read as real numbers in the F10.3 format. 

Let the elements of the first variable be 27.5 and 
31; let the elements of the second variable be 13, 17, 25 

and 35. The first three lines of the data file would appear 
as: 



This is 
2 2 
27.500 



data set 
4 0 

31 .000 



name 
0 0 
13. 000 



0 0 0 0 0 .. . 
17.000 25.000 35.000 



The zeros xo the right of the decimal point are 
written by Lochinvar but are not necessary. The number 
could have been written without decimal points, with proper 
alignment, as shown below. However, when numbers are 
written with decimal points, a mistake in alignment is no 
problem; without decimal points, alignment is crucial. 
Using only alignment for data elements: 

This is a data set name I 

2240000000 0... 

275 31 13 17 25 35 

d. Variable names 

The variable names begin on the line directly 
following the last data element line. Each variable name 
occupies a line; there must be the same number of lines for 
variable names as there are number of variables indicated in- 
line two. 
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Let freshman be the name of the first variable and 



senior be the name of the second. 



I 

0 0 0 .. . 

25.000 35.000 

senior 



This is a data set name 
22400000 
27.500 31.000 13.000 17.000 

freshman 



C. LOCATION ON THE USER’S A DISK 



• Def aul t Loca t ion 

As mentioned repeatedly throughout the text, 
the data files are named "lochi a", . . ."lochi f". 
writes files to those six locations; Lochinvar 
easily modified unless the user is experience in 
Pascal. 



by default 
Lochinvar 
will not be 
progra mming 



2. R eadin g Files from other than Default Lo c ation s 
The program Crunch reads ”lochi a" lochi f** based 
on the file definitions in the file "CRUNCH EXEC" and the 
number of input file (NIP) of the options file. "CRUNCH 
EXEC" is explained in Appendix E, while the input file is 
explained in Appendix D. To have Crunch read from a prop- 
erly written data file named anything other than "lochi 
a" lochi f", the user will have to do the following: 

a. Write an options file for the data (see section 
III and Appendix D) , either manually (not recommended) , or 
else using Lochinvar, specifying a data set with the same 
general c la rac t eristics as the test data set. For example, 
if the test data set is named QUEUE DATA, with four vari- 
ables of equal length, tell Lochinvar to run the desired 
test on data set "lochi a", given that "lochi a" has four 
variables of equal lenghth 

b. Since the number of inpur file written in the 
options file is "1", corresponding to the default location 
"lochi a", change the file definition in the file "CRUNCH 
EXEC". The origin file "CRUNCH EXEC", 
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&TRACE 

SERROR SEXIT 99 
FILEDEF 01 DISK LOCHI A 

LOAD CRUNCH (START 

beccmes, 

STR ACE 

SERROR SEXIT 99 

FILEDEF 01 DISK QUEUE DATA 

LOAD CRUNCH (START 

c. Alternate Method 

It would be easier, in the example above, 
rename the current file "lochi a" to some other name, 
rename "QUEUE DATA" to "lochi a". 



to 

and 
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APPENDIX D 
THE OPTIONS FILE 



A. PURPOSE 

This appendix describes the options file. The twenty 
nonparametric tests which require ordinal or interval data 
level of measurement all have options files with similar 
format. The four tests which can handle nominal data have 
slightly idiocy ncratic formats, which contain all the infor- 
mation of other test plus something additional. 

This appendix is included for completeness of documenta- 
tion of this text. It is not recommended that a user 
by-pass Lochinvar to write the options file by hand. If the 
computer package is implemented on an operating system which 
cannot run Pascal, though, that may be necessary. 

B. GENERAL INFORMATION ABOUT THE OPTIONS FILE 

1 . Name 

Lochinvar writes the options file to "lochi op" on the 
user's A disk. 

2 . General F or mat Rule s 

As in the data file, all integers are written in the 
format 1615, all real numbers are written as 8F10.3, all 
names are in 48 character lines. See Appendix C, paragraph 
B. , for all the different ways to have real numbers repre- 
sented for the program Crunch to read. 

3* E xplan ation of the O pt ion s File Format 
a. List of Line Numbers 

There are 14 line numbers: 1, 2, 3, 3a, 4, 4a, 5, 
5a, 6, 6a, 7, 7a, 8, 8a. 
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b. Lins Numbers in the Option File 

Each of the 14 line numbers occupies at least cne 
line (or card image) in the options file, unless “hat line 
is omitted. There is not a blank card for omitted line 
numbers; rather, the previous line is followed by the next. 
For example, if line numbers 3 and 3a are ommited, line 4 
immediately follows line 2. 

The lines with ''a'' in their number may occupy more 
than one line (or card image) in an actual file, because 

they may contain more information than would fit on a single 

80 column card. 

c. Rules for Inclusion of Lines in Options File 

Lines 1 and 2 ar e included in every options file. 

Lines 3 and 3a are omitted unless the test is one 
of the four which handle nominal data. 

Lines 4 and 4a will be included in every test 
except when the user has entered call counts (in one of the 
four tests which use nominal data) . Line 4 is a single 

positive integer value which tells how many values to read 

in line 4a. 

Line 5 equals "0" and line 5a is omitted unless 
the user has excluded blocks of the data set from being 

tested. In that case, line 5 indicates the number of 

elements to be read in line 5a. 

Lines 6 and 7 are included in every test, and 
will contain single integer values. If a value of "0** is in 
one of those lines, then the corresponding *'a” line will be 
omitted. For example, if line 6 has a *'0”, then line 6a 
will be omitted. If the single value in lines 4, 5, 6, and 
7 is net "O”, it must be a positive integer, representing 

the number of elements to be read in the corresponding ''a” 

line. For example, if the single value in line 5 is "3”, 
there will be three elements to read in line 5a. 
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The "a" lines may occupy as many actual lines in 
the data file to contain all the values to be read. 

Lines 8 and 8a are used only when the user has 
entered cell counts for a test involving nominal data, 
d. Content of the Lines 

(1) Line 1: the number of input file (NIF) , and 

number of output file (NOF) , both integers 

(2) Line 2; the number of the requested test 
(NTEST) , and whether the test involves cell counts (CELLS) , 
both integers, CELLS=1 if the test is one of the four which 
can use nominal data. 

(3) Line 3: the cell counter option (TYPCEL) , the 

number of rows in the call matrix (NROW) , the number of 



columns 


in that matrix 


(NCOL) , 


and the 


number of elements 


in 


line 3a 


(NTORD) , all 


posit ive 


integ er 


s. Depending on 


the 


TYPCEL 


option, line 


3a will 


contain 


either integers 


or 


reals. 


(4) Line 4 


: the 


number 


of treatments to 


be 


included 


in this test 


(NTREAT) 


, a positive integer. Line 


4a 



contains the indices of the variables of the data file which 
are included in the data set to be tested. 

(5) Line 5: the number of blocks excluded from the 
test (NEXB) . Line 5a contains the indices of excluded 
blocks. 

(6) Line 6: the number of integer parameters, a 

non-negative integer. If line 6 is greater than zero, line 
6a contains the integer parameters for the test. 

(7) Line 7: the number of real parameters, a non- 

negative integer. If line 7 is greater than zero, line 7a 

contains the real parameters for the test. 

\ 

(8) Line 8: data set name. Line 8a contains the 

names of data set variables. 
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C. EXAMPLE OF OPTIONS FILS FOR ORDINAL OR INTERVAL TEST 

Tabls 5 contains a rap resentative data fila, with the 
line numbers of the options file for the twenty tests which 
require ordinal or interval data. This example is for the 
test to perform the problem contained in Appendix A, i. e. , 
problem 1, page 309 of Conover [ref 1]. In this example, 
three of four variables were used, and one block excluded. 
Line number are enclosed by < >. Comments are enclosed by 
(*“*) ; the comments refer to the names of variables, as 
mentioned in the previous section. 



TABLS 5 
0 ptions 



< 1 > 1 11 

<2> 17 0 

<4> 3 

<4a> 1 2 

<5> 1 

<5a> 3 

< 6 > 0 

<7> 1 

<7a> 0.95 



(*NIF=1 ,NOF=1 1*) 

(♦NTEST=17,CELLS = 0*) 

(*NTREAT=3*) 

4 (*indice3 of treatments*) 
(*NEXB*) 

(♦index of the block excluded*) 
(♦number of integer parameters=0*) 
(♦number of real oarameters=1 ♦) 
(♦real parameter=0. 95*) 
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APPENDIX E 
THE PROGRAM CRUNCH 



A. PURPOSE 

This appendix is intended to assist implementing 
Crunch cn systems ether than the IBM-3033 at the Naval 
Postgraduate School. It will also assist in modifying the 
program, or adapting portions of it for use in other 
programs. 

B. DESCRIPTION OF THE PROGRAM STRUCTURE 

1 . G enera l 

While Lochinvar is designed to be used only as an 
integral unit. Crunch is separable. Therefore, many things 
which are centralized in the main program of Lochinvar--such 
as dimensioning and variable declaration--are done in each 
subroutine of Crunch. Explanations and documentation are 
also at the subprogram level. 

2 . Sub pro gram G r cup s 

Crunch can be thought of as a system of five groups, 
each made up of several subroutines and functions. 

a. Main Program 

The main program, together with the subroutines 
Fname and Vname, form one group; they read the data set and 
variable names, call the correct data formation group, then 
call the correct test subroutine, and remind the user which 
file contains the output. 

b. Data Formation Group for Cali Counts 

If the test involves call count (Binomial, 
Quantile, McNemar, Test for Independence, Cochran) , the data 
formation group Caller is called by the main program. Cellar 
and the subroutines it calls produce che cell counts used as 
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the raw material for calculations in the test subroutine. 
Those cell counts are contained in the vector RCCT, which is 
dimensioned b/ the number of rows NROW and nhe number of 
columns NCOL. 

c. Data Formation Group for Data Values 

For all other tests, the master reader group is 
called. Mstrdr and its subroutines read the specified data 
file and the options file. They produce a list of the data 
elements which are relevant for *he test, i.e., Mstrdr 
includes only the data set variables indicated, and excludes 
any blocks or pairs specified. These data values are 
contained in the vector LIST. LIST is dimensioned by the 
pointer vector LPTR. LPTR (1) is always 1. LPTR(i) is the 
index of the first element of the irh variable in LIST. 
With k variables, L?TR(k+1) is one more than the lasr 

element of LIST. For example, if LIST contains three vari- 
ables, with two, four and seven elemenrs each, LPTR will 
have four elements, (1,3,7,14). In dereferencing variable 
values, the ith variable will have elements LIST (LPTR(i) ) 
through LIST (L PT R(i+ 1) -1 ) . NTREAT is the number of vari- 
ables included in the test. 

d. Utility Subroutines and Functions 

Many subroutines are called by more than one test 
subroutine. These versatile subroutines are grouped 
together . 

e. Test Subroutine Groups 

Each of the 24 tests has a principle subroutine 
called by the main program. In general, these subroutines 
perform the bulk of the computations for the test statistic 
or level of significance. However, some of the computation 
is done by calling other subroutines. Subroutines which are 
called only by one test are listed contiguous to that test's 
subroutine. Subroutines which are called by more than one 
test are listed in the utility subroutine group. 
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Ths test subroutines make use of either LIST or 
RCCT in their computations. 

3 . Lan gua ge 

Crunch is written to be compatible with the IBM 
Fortran H compiler. Only the most rudimentary syntax was 
used. The only loop statements are the do for loops of the 
form, 

DO 1 0 I=LL,UL 

where 10 is a line label for a continue statement., LL is the 
lower limit and OL is the upper limit of the loop. All 
other loops or branching is done with if statements. This 
very primitive, unstructured Fortran should be compatible 
with virtually any computer system. 

4 . I nput/ Output Files 

The IBM file definitions are contained in the file 
"CRUNCH EXEC". All files except 08 (terminal) are implic- 
itly fixed format, logical record length 80. Files 01 
through 05 refer to "lochi a" through "lochi e" ; file 06 is 
"probleA data"; file 07 is "lochi f " ; file 10 is "lochi op"; 
files 11 through 16 are "u listing" through "z listing". 

The number of input file (NIF) and number of ourput 
file (NOF) are listed in line one of the options file, and 
read by Crunch’s main program. Files 0 1 , . . . ,05 , 07 , and 10 
are candidates for NIF; files 11,... ,16 are candidates for 
NO?. File 08 is used only to remind the user of which output 
file contains the test result. File 06 contains system 
error messages, while file 09 contains intermediate computa- 
tions of Crunch. 

The main program reads lines one and two of the 
options fils. Either Cellar or Mstrdr read most of the rest 
of the data and options fils. Fname reads the dara set name 
and writes it to the output file; Vname does the same for 
the variable names. 
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The test subroutines write all of the test results 
files "u listing” through '• z listing”, except the data set 
and variable names. 

5 . Di mensions and Peel a ra ti ons 

At the beginning of the main program, and of each 
subroutine, the variables used are declared and the arrays 
dimensioned. All declarations and dimensioning is explicit. 
Although arrays could be dimensioned by variables in subrou- 
tines, this is not done. For example, with TOTELM as the 
integer representing the total data elements of the test, 
LIST could be dimensioned in a test subroutine as 
LIST (TOTELM) vice LIST (400). This was not done; as 
explained above. Crunch is intended to oe separable. To 
change the capacity of the program, a global change command 
may be issued, if the installation has a good text editor. 
Otherwise, the dimensions must be changed one by one. 

There are a very few arrays which consume inordinate 
amounts of storage fer what information they provide. They 
have the dimension of 20,000; that dimension allows storage 
of a comparison cf each of 400 data elements with each other 
data element. On the IBM- 3033, these large arrays require 
the user to request 1 megabyte of storage. On smaller 
systems, these arrays may be redimensioned down to exactly 
the size necessary (if lc=the number of elements, k(k-1)/2). 
An alternative is to eliminate them altogether, and take out 
test procedures which use them. 

6 • Description of V ar i ables 

At the beginning of the program, there is a brief 
description of all variables and arrays which are used or 
passed by the main program. After the subprogram title 
line, a similar list defines the variables used exclusively 
among subroutines of that subprogram. 
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Above the variable declararions in each test subrou- 
tine, there is a description of each of the integer and real 
parameters used in that rest. 



C. EXTERNAL SUPPORT 



The non-IMSL subroutine Shsort is essential to the oper- 
ation of the program Crunch. Irs documentation is available 
to any user at the Naval Postgraduate School. Any system 
sort routine can be used to support Crunch, which is 
currently configured to pass a real vector of values to be 
sorted, an integer vector (1,...,n) ~o receive the order of 
the original data values, and the number of elements n. 

The non-IMSL Otest was used for the Mann-Mhitney test. 
Its value is that it not only computes the test statistic, 
but also the level of significance. Modifications to 

Crunch could easily allow computation of the test statistic. 
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APPENDIX F 

FORTRAN SOURCE CODE: CRUNCH 



THIS PROGRAM TAKES DATA FROM TEST A- F FILE, INFORMATION 
FRCM TEST OP FILE TC PERFORM NON-P ARAM ETRI C TESTS 

SUBPROGRAM GROUPS 

MAIN CONTAINS THE MAIN PROGRAM, FNAME, VNAME , REMIND 

RE^OS TOP 2 LINES OF OPTIONS FILE, WRITES THE 
TCP OF OUTPUT FILE, CALLS CELLER CP MSTRDR, 

TEST SUBROUTINE; REMINDS WHICH IS CUTPUT FILE 
CELLER CONTAINS CE L LER , COCH ,TC 3, T C 3LN , C ICO T ,TC4 ,TC4LN , 
PAIRST ,MCOUNT. FINDS THE CELL COUNTS FCR FIVE 
NP TESTS: BIN OMI AL ,QUANT I L£ , MCNEMAR, C OCHARAN , 

AND RC CONTINGENCY, CELL COUNTS TO RCCT; 
DIMENSIONED BY NROW AND NCOL 
MSTRCR CONTAINS MSTRDR , SU BL 1ST ,NUL 1ST , 

READS THE DATA FILE NUMBER OF VARIABLES, 4 
ELEMENTS/VARIABLE, DATA ELEMENTS. INCLUDES 
VAR'S CF TEST INTO LIST, EXCLUDED PAIRS OR 
BLOCKS AS REQ. YIELDS LISTIDATA ELMSJ, 
LPTRIPCINTER FOR LIST), NTREAT(i¥ OF VAR IN 
TEST), TOTELM (# ELMS IN LIST) 

UTILITY CONTAINS GENERAL SUPPORT SUBROUTINES/FUNCTIONS 

SUB ROUT INES: EXPF, FINOS,FINOT, MUVAR,NORMF,NORMV, 
PARCQM ,RANCUE , RH OER, RNQ, R PARAM 
TESTS CONTAINS THE 24 NONPARAMETR IC TESTS: USE EITHER 
EITHER LIST OR RCCT; COMPUTE TEST STATISTICS 
OR LEVEL OF SIGNIFICANCE; WRITES TC OUTPUT 
FILE 

THE GRCER IN THE PROGRAM IS THE SAME AS THE 
NUMBER OF THE TEST, NTE ST 

VARS 

NIF NUMBER 1-6 INDICATING FILE TEST A-H FCR DATA 

IHOLD INT V£CT0R(16) GENERAL PURPOSE HOLDER 
NVARS INT NUMBER OF VARS OR TREATMENT IN DATA SET 

NTREAT INT NUMBER OF VAR/TREATS TEST ACTUALLY USES 
TREAT INT VECT0R(16) INDICES OF TREATS OR VARS 
ACTUALLY USED 

NELMS INT VECT0R(16) # ELEMENTS OF DATA IN EACH 
VAR/TREAT; IF SOME BLOCKS EXCLUDED, 

NUMBER REDUCED 

TOTELM INT TOTAL 4 OF DATA ELEMENTS IN DATA SET; 

IF SOME BLOCKS EXCLUDED, NUMBER REDUCED 
NTEST INT INDEX OF NON-PARAMET RIC TEST TO BE USED 
EXB INT VECTOR(200) INDICES OF BLOCKS TO BE 

EXCLUDED ;MAY BE EMPTY 
NEXB INT NUMBER OF ELEMENTS OF EXB 
IP INT VECT0R(50) INTEGER PARAMETERS 

IPl INTEGER VAR FOR CELLER: MAYBE NROW OR SOME 

CTHER VALUE DEPENDING ON TYPE OF CELL 
CEFINIT ION (SEE TYPCEL) 

NIP INT NUMBER OF ELEMENTS IN IP 

RP REAL VECTOR! 10) REAL PARAMETERS 

NRP INT NUMBER OF ELEMENTS IN RP 

MF INPUT FILE INDEX 

NOF CUTPUT FILE INDEX 

LIST REAL VECTCR(400) DATA ELEMENTS 

LPTR INT VECTOR!!?) POINTS TO THE FIRST ELEMENT 

OF EACH VAR OR TREATMENT'S DATA IN LIST; 

THE LAST ELEMENT OF LP TR=T CTEL M,+i 
RNVARS REDUCED NUMBER OF VARIABLES 

RNELMS VECTOF OF REDUCED 4 CF ELEMENTS PER VARIABLE 
OICOP =-l IF DICOTQMIZE BY PARTITION, ELSE=NUMBER 
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CF VALUES OF SMALLER SUBSET CF DATA, 

C IF DATA NOT DICOTOMIZED 

DIVAL REAL FARTITI CiN VALUE OR LIST CF SUBSET VALUES 
NSUCC IF DICOTOMIZED DATA, HOW MANY OF CNE SUBSET 

CELLS IM =1 IF TEST USES CELL COUNTS, ELSE =0 

TYPCEL INTEGER METHOD OF DEFINING CELLS: 

1- DICOT PARTITION, 

2- DICOT LIST,3-R*C PARTITION, 

^-FIXEO ROW/COL 

COUNT, 5-ENUMERATED CELL COUNT 
RCCT RCW,CCL CELL COUNT, BY RCWS (RE FIRST VAR), 

E. G.,ALL COLS OF ROW 1, THEN ALL COLS OF ROW 2 
ROWNR VECTOF OF ROWS TO WHICH 1ST VAR ELcMS BELONG 

COLNR VECTOR OF COLS TG WHICH 2D VAR EL5MS EELQNG 

RORC INTEGER LOGICAL 1=R0W 0=C0L 

NTORD INTEGER # OF VALUES TO READ (W/I CELLER) 

CIV INTEGER VECTOR CELL INPUT VALUES 

CRP real VECTOR CELL INPUT VALUES 

TIEV INTEGER VECTOR OF T I ES :T I £V ( 1 ) . L E ,T I £V( 2 ) .GT ; 

BOTH WITHIN EPSILN OF PARTITION VALUE 



INTEGER NIF,NFILE ,IHQLO( 16) , NVAR S, TO TELM , NTES T , 
ENTREAT, I POO) ,NR P , LP TR ( 17 ) , N ELMS ( 16 ) ,RNELMS( 16) , 
#OICOP,NSUCC,CELLS ,N0F,CIV(50 ) ,NT CRD, RCkC , ROWNR ( 50 ) , 
=i=RCCT (50) , TYPCEL, TIEV (2) ,EXB( 200) , TR£ AT ( 1 6 ) , RN VARS , 
♦REDELM, CCLNR( 50) 

REAL RP( 50 ,L IST(400 ) ,01 VAL( 200) ,CRP(50) 

101 FORMAT! 16 15) 

102 FORMAT(3F10.3 ) 

201 FORMAT! IX, 161 5) 

202 F0RMAT(1X,8F1C.3 ) 

R£AD(10,1C1) MF.NOF 
R£AD(i0,101) NTEST,CELLS 
WRITE(9,2C1) MF,NOF,NT£ST 
TYPCEL=0 

IF ( CELLS. CQ.I) CALL CE LLER( NI F , NOF , NTES T ,NRO W,NCOL , 

* RCCT, LIST, LPTR,NVARS,NTREAT, NELMS, RP, IP, 

« TREAT , TIEV , TYPCEL) 

IF (CELLS. EG. C) CALL MSTROR ( NVAR S ,NE LMS, L 1ST , 

=S'LPTR ,T0TELM,NTREAT,NIP, IP ,NRP,RP , NI F , NOF , TREAT ) 

IF (( TYPCEL. N£ .62 ) .AN D . ( TYPCE L.NE . 72 ) ) 

CALL VNAME (NI F, NOF, NVARS,NTR£AT, TREAT ) 

IF(NTEST.EG.l ) CALL BINTST 

* ( N0F,RCCT,RP,IP,ALPHAT,CILL,C1UL) 
IF(NTEST.EG.2 ) CALL QUANTL ( NCF , R CCT , RP , I P ,ALP )*aT , 

* Cl LL, Cl UL, LIST, LPT R,T IE V, TYPCEL) 

IF(NTEST .EQ.3 ) CALL CQXSTU( L 1ST , LPTR , IP , RP,NOF,NT£ST ) 
IF(NThST.EC.4 ) CALL MANWHT( LPTR , LIST , NOF , RP ) 
IF(NTEST.£Q.5 ) CALL SGNT ( LI ST,LP TR ,NOF , TO TE LM ,RP , I ? , 

« NT EST ) 

IF(NT£ST.EG.6 ) CALL MCN£MR( RCCT , RP , I P , NOF ) 

IF(NTEST .EQ.7 ) CALL RCONT( RC CT, N ROW , NCOL , NOF ) 
IF(NTEST.EG.8 ) CALL GDF IT (L I ST , L PTR , IP ,RP ,N0 F ) 
IF(NTEST.EG.9 ) CALL WILCOX! L 1ST, LPTR , IP , RP, NOF) 
IF(NT£ST .EG.IC) CALL CORREL 

* ( LIST , LPTR, NT BEAT , TRE AT ,N CF, I P ,RP ) 
IF(NTEST .EQ.IDCALL R£GRET( LIST, LPTR , I P , RP , TR EAT ,NOF ) 
IF(NT£ST .£G.12)CALL MCNOFE(LIST, LPTR, IP , RP, TR EAT ,NOF ) 
IF(NTEST,EQ.1E)CALL MEDIA(NTR£AT , LI S T ,LP TR,N I F, I P ,N0 F ) 
IF (NTEST . EQ.1^)CALL KRUWAL 

* (LI ST, LPTR, IP, R P, NOF, NTR EAT , TREAT) 
IF(NTEST.£G. lf)CALL SGUARK 

* (LI ST, LPTR, NTRE AT, TREAT , IP,RF,NOF) 
IF(NTEST.EC.16)CALL HARTLE 

* (Ll S T, LPTR, NTRE AT, TREAT, IP, RP, NOF) 
IF(NTEST .£G.17)CALL FRIEOM 

* (NTR£AT,LPTR,LI ST,IP,RP ,NOF,TOT£LM) 
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IF(NT£ST,EQ.l £>CALL QUDE 

* (NTREAT,LPTRfLlST,NCF, TCT5L.-U 

IF(NTEST.EQ.1S)CALL CCCHBN(NROW, NCOL , RCCT , IP , RP , NOF J 
IF(NTEST.EQ.2CICALL KQLMGG( LIST, LPTR , TRE A T, I P ,RP,NOF ) 
IF(NTEST.EQ.21)CALL L IL I EF( L 1ST , LPTR , TRE A 7, 1 P ,RP ,NUF ) 
IF(NTEST,EH,2 2>CALL ^/ILKSHILIST, LPTR , TRE AT, I P ,RP , NJF i 
IF (NTEST. E3. 2 2)CALL SMI RM2( L I ST , LPTR , TRE A T, I P ,RP ,NOF ) 
IF(NTEST.E(;.2 4)CALL CRVONMIL 1ST, LPTR , TREAT, IP ,RP ,;iOF ) 
CALL REMINGINCFi 
STOP 

END _ __ 

SUBROUTINE REMND(NOF) 

INTEGER NCF 
k»RITEt8,lCiJ 

101 FORMAT! IX,' TFE RESULTS ARE LISTED IN FILEM 
IF (NOF.EC.ll ) WRITE (3,11) 

11 FORMAT! !>,’ U LISTING' ) 

IF (N0F.EC.12 ) WRITE !8, 12) 

12 FORMATUX,' V LISTING') 

IF (N0F.EC.13) WRITE!8,13) 

13 FORM AT! IX,' W LISTING*) 

IF ( NOF. EC. 14 ) WRITE !8,14) 

14 FORMATdX,' X LISTING') 

IF ! NOF. EC .15 ) WRITE (8, 15) 

15 FORMATdX,' Y LISTING') 

IF (NOF. EC. 16 ) WRITE (8, 16) 

16 FORMATdX,' Z LISTING') 

RETURN 

END 



SUBROUTINE VNAME!MF , NOF , NVA RS , NTREAT ,TR E AT ) 

C THIS READS EACH VARIABLE NAME, AND IF THE VARIABLE 
C IS IN THE TEST, WFITE THE NAME INTO THE OUTPUT FILE 
INTEGER N IF, NCF, NTREAT, TREAT ! 16) ,NVARS,I , J 
J=1 

IF ! NVARS.5Q. NTREAT) WRITE!NCF,402 ) NVARS 

402 FORM AT( IX , 'ALL', 15, 

* ' variables CF data set used in THIS TEST') 

IF ( NVARS. N£. NTREAT) WRI T E( NCF , 4 03 )* NVARS , NTREAT 

403 FORMAT! IX, 'OF', 15,' VARIABLES IN DATA SE T , ' , / , 5X , 1 5 , 

« » LScD IN THI S TEST* ) 

CO 10 I=1,NVAFS 
READ (NIF,20C) 

IF ( I.NE.TRE AT! J ) ) GO TO 10 
WR ITE( NCF,4C1) TREAT(J) 

^01 FORMATdX,' THE NAME OF VARIABLE/TREATMENT )^',I4) 

J= J+1 

WRITE! NCF, 2 CO) 

10 CONTINUE 
200 FORMAT ! IX, 

^SOHAAAABeeaCCCCDDCDE EEEFFFFGGGGOOOOI I 1 1 J J JJKKKKLLLLM M ) 
99 CONTINUE 
RETURN 
END 

SUBROUTINE FNAME !RORW,IF,OF) 

C THIS READS TFE FILE NAME FROM FILE "IF", IF RORW IS 1, 

C AND WRITES THE FILENAME TO OF IF RORW IS 2 
INTEGER RCRW, IF, OF 
IF ( RCRW.GT.l ) GO TO 10 
READ ( I F,20C) 

GO TO 99 

10 WRITE ( OF ,122 ) 

122 FORiMAT (lOX, » NAME OF DATA SET:') 

WRITE ! OF, 200 ) 

2C0 FORMAT (IX, 
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« 4 SHAAAABEeBCCCCDDDDEEEEFFFFGGGGOOOOIIII JJJJKKKKLLLL i 
9 9 CONTINUE 
FETURN 
END 

r**#* # ;} :ji * :«c 4 ♦♦ =it =)< 

SUBRCUT INE CELLER(NI F,NOF,NTEST» NROW ♦ NCOL ,RCCT, 

*LIST ,LPTRfNVAFSf N7RE AT » NELMS »RP, IP, TR EAT , 71 £V ,TY PCEL ) 
THIS SUBPROGRAM CROUP FINOS THE CELL COUNTS FOR CNE OF 
FIVE NCNPARAMETRI C TEST — BINOMIAL, QUANTILE, COCHARAN, 
MCNEMAR, RC CCNTINGECY (CHISQAURE TEST FOR INDEP»— AND 
PLACES THOSE CELL COUNTS INTO THE VECTOR RCC T , WHICH IS 
DIMENSIONED BY NRCW AND NCOL* UNLESS THE USER HAS 
ENTERED THE CELL COUNTS, THE SUBROUTINE CALLS SUBROUTINE 
MSTRCR TO REAC THE DATA SET INDICATED BY NIF. THE DATA 
VALUES ARE IN LIST, WHICH IS OIMENSIGNEO BY LPTR. 

SUBROUT INES : 

DICOT PERFORMS THE C IC OTOMI ZAT ION FOR THE BINOMIAL AND 
QUANTILE, GETTING # SUCCES S=RCCT(1) AND # FAIL 
=RCCT(2) TYPC£L=2 IF BY LIST, =1 IF BY PARTVAL 
COCH GETS THE ELOCK AND TREATMENT SUMS OF SUCCESSES 
BY PARTVAL; BLOCK SUCCESSES INTO RCCTil 
RCCT(B); TREAT SUCCESS INTO RC CT ( B + U , . , 

RCCT(B+T); TYPCEL= 

MCOUNT GETS THE FOUR CELL COUNTS FOR THE MCNEMAR TEST. 

RCCT(l) IS SUCCESS/SUCCESS, RCCT(2) SUCCESS/ 
FAILURE, ETC. TYPCEL= 

TC3 DIVIDES PAIRS INTO CELLS OF AN RC CONTINGENCY 

TABLE, EASED ON PARTITION VALUES. RCCT(l) 

IS FIRST ROW FIRST COL, RCCTI2J IS FIRST ROW 
ScCCND COL, ETC. TYPC£L=3. CALLS PAIRST, 

TC2LN, AND RCC TR 

PAIRST SORTS THE VALUES OF THE TWO VARIABLES, ANC KEEPS 
AN INDE> KEY ON EACH, IN ORDER TO RE-PAIR 
THE PAIRS; K£Yl=ROW, KEY2=COL 
TC3LN called BY TC3. ASSIGNS AN INTEGER CORR ES PONDING 
TO THE ROW IN WHICH AN ELEMENT OF THE FIRST 
VARIABLE FALLS, OR THE COL IN WHICH AN ELEMENT 
OF SECQ^D VARIABLE FALLS, BASED ON PARTITION 
VALLES CONTAINED IN VECTOR CRV. INTEGER VECTOR 
RORCNR CONTAINS THE INDICES OF ROW OR COL 
ASSIGNED FOR EACH VALUE 

RCCTR TAKES THE RORCNR VECTORS FOUND BY TC3LN OR TC4LN 
ANC FORMS CELL COUNTS 

TC4 DIVIDE THE DATA PAIRS INTO THE CELLS CF AN RC 

CONTINGENCY TABLE BASED ON FIXED ROW AND COL 
TOTALS. TYPCEL=4. CALLS PAIRST, TC4LN, AND 
RCCTR 

TC4LN ASSIGNS A VALUE TO A ROW OR COLUMN, BASED ON 

FIXED RCW OR COL TOTALS; THE VECTOR OF ASSIGN 
IS RORCNR 

INTEGER NIF,NFIL£,IHCLD(16) , NVAR S, TOT ELM ,NTEST, 
ENTREAT, I P( 50 J ,NRP , LP TR( 1 7 ) , N ELMS < 16 i ,RNELMS( 16) , 

♦ DICOP,NSUCC,CELLS ,N0F,CIV(50 ),NTORD,RGRC ,ROWNR(50) , 
*RCCT (50) ,TYPCEL,KEY1 ,KEY2,K,KK,I , J , J J , IP 1 , NROW, NCOL , 
*EXB( 200) ,TREAT(16 ),RNVARS,R£DELM ,COLNR( 50 ) , T , LL ,UL, 
*7IEV(2) 

IST(400 ) ,0IVAL(200) ,CRV (50) ,VAL1 (200), 



TYPCEL, IPl ,NCOL,NTORD 



) 



REAL RP(5C), 

*VAL2 (200 ) 

READ (10, ICl) 

101 FORMAT(16I5) 

102 FORMAT( 3F10.3 

201 FORMAT! i X ,icl 5) 

202 FORM AT( 1X,8F1C.3) 

NR0W=1 

IF ( TYPCEL. GT. 2) NR0W=IP1 
WRITE (9,209) TY PCEL , NROW , NCOL, N TORO 
209 FORM AT ( 1 X , * TY FECE L ,NRCW , NCOL, NT CRD, * 



p4I5 ) 
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T=TYPCEL 

IF ( (T. NE.5) ./5N0. (T.NE.61J.AND. ( T.NE.62) .ANO. 

* (T.N E.71 ) .AND .(T *NE. 72) ) GO TO 10 

REAO(iC,iCl) ( RCCTa ) ,1 = 1 ,NT0RD) 

IF ( ( T.EQ.^2) .OR. ( T.EQ.72.) ) 

* CALL F^AME (1, NIF, NCF ) 

LIST! 1)=0. C 

LPTR( 1 ) = i 
N VARS=IP1 
NTREAT=IP1 

IF(T.EC.71) NTREAT=NCOL 
IF(T,EC.71i NVARS=NCOL 
DO 5 KK = 1,MREA7 
TREAT(KK)=KK 
5 CONTINLE 

N5LMS( 1 ) = 0 
NTREAT=NCCL 
REAO(10,lCiJ MRETE 
l^RITE (9,401) iNTRETE 

401 FORMAT! 1 X ,•NU^BER OF TR EATMENTS • , 15 , * INDICES*) 

RE AO (10, 101) (TREAT! 1 ), 1=1, N TRET E) 

WRITE (9, 201) (TREAT! I), 1 = 1,NTRETE) 

REAO(iO,lCl) ^EX6 
WRITE (9,202) NEXB 
C302 FORMAT! IX, ‘NUMBER OF EX BLOCKS*, 15,* INDICES*) 

C IF (NEX8.GE.1) READ! 10, 101) ( EXB ( 1 ) , I =1 , NEXB ) 

C IF ( NEX6.CE.1 ) WRITE (9, 201) ( EXB ( I ) , 1 = 1 , NEXB ) 

CALL RFARAN(NIP,NRP,IP,RP) 

IF ( ( T.NE.d2) .AND.(T.NE.72.) ) 

* CALL FNAME! 1,10,N0F) 

CALL F^AME(2,9,9) 

CALL FNAME(2, 10,N0F) 

GC TO 99 

10 IF ( (TYPCEL.G7.3 ) .AND. ( TYPCEL.lt. 6) ) GO TC 12 
READ(10,102) (CRV ( I) , I=1,NT0R0) 

WRITE(9 ,202) (CRV( I) ,I=1,NT0RD) 

GO TO 14 

12 FEA0(10,lCi) (CIV! I) , 1=1 ,NTORD) 

WRITE(9,2C1) (Cl V(I) ,1 = 1 ,NTORD) 

14 CALL MSTPCB(NVARS, NELMS, LIST ,LPTR,TOTELM ,NTR£AT, NIP, 

* IP ,NRP ,PP,MF,NCF, TREAT) 

IF(TYPC£L.LE. 2) CALL DI C CT( NOF , N TORO , TYP CEL , I FI , CRV , 

* L 1ST, LFTR, TOTELM, RCCT,TIEV) 

IF (T YPC5L.EQ.2) CALL TC3 ( NTREAT , LIST , LPT R ,CRV ,NTORO, 

* IP1,NC0L, RCCT ) 

IF(TYPCEL.cQ.4) CALL TC4 ( NTR EAT , L 1ST , LPTR ,C I V , 

* NTORC ,IP 1,NCCL,RCCT) 

IF(T YPCEL.cQ.63) CALL MCOUNT ( LIS T , LP TR ,C R V, RCCT , 

* TREAT) 

IF(TYPC£L,EQ. 73) CALL CCCH 

* (NTREAT, NCF, L 1ST , LP TR, R P ,NTORO, CRV , NROw , NCOL , RCCT ) 
IF(TYPCEL.EQ. 73) GO TO 99 

WRIT E(9, 3C1 ) 

C WRITE(NOF ,301 ) 

301 FORMATdX,' CELL MATRIX:') 

CO 80 I=1,NRCW 

LL=l + ( I-1)=»NC0L 
UL=I *NCGL 

WRITE (9,2C1) (RCCT(J),J = LL,UL) 

C WRiTE (NOF ,201) ( RCCT ( J) , J=LL ,UL) 

80 CONTINUE 
99 RETURN 
END 

SLBROUTINi COCH 

- (NTREAT ,NCF , LIS T , LP TR , RP ,NTORO , CRV , NROW , NCOL ,RCCT) 
THIS FINOS WHICH DATA ELEMENTS ARE SUCCESSES, 



C- 

C 
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101 



5 



20 

10 



201 

202 



C 



101 



C 



C 



ev COMPARING LIST WI 
FcR ROW £ COL, SENDS 
SUCCESS PER RCWIBLOC 
PER CCL( TREAT ) 

REAL LISK^OO ),RP(50 
INTEGER NTREAT,LPTR( 

* LL ,UL,NT 

N=IFIX(FLCAT(NTQRO)/ 
IF (N.NE.MRE4T) WRI 
FORMAT! IX, 

* • EXPECTED NURBER OF 

* /'OF PAR7ITICN VALU 

« • WHICH DOES NOT AG 

* /*L ISTEC NUMBER CF 
NT=NROW+NCCL 

DO 5 1=1, M 
RCCT ( I ) = 0 
CONTINUE 
CO 10 1 = 1, N 

MULT = CFV ( ! 2*n ) 
PT=CRV( (2«H-1 )-M 
I C=I+NRCW 
LL=LPTR!I) 
UL=LPTR(I-»- I)-l 
DO 20 J=LL,UL 
IR=J-LL+1 
V = LIST(J ) ♦MU 
IF(V.GT.PT) 
FCCT (IR 
FCCTdC 



TH PT, COUNTS THE # OF SUCCESSES 
TOTS INTO RCCT, INDEX 1-NR0W= 
K),NR0W+1 TO NROW+NCOL=SUCCESS 



) . 
17 



,CRV(50) ,PT ,MULT,V 
7) ,NRQW,NCOL,RCCT( 



f(50l,N, IC, IR, 



2) 

T£ (N0F,101) N,NTREAT 

TREATMENTS BASES ON NUMBER* , 
ES WAS* , 14, 

REE WITH THE * , 

TREATMENTS, * ,I4> 



ULT 



LT 

GO TO 20 

)=RCCT(IR)+1 

J=RCCT(IC)+1 



CONT iNLc 
CONTINUE 
NR0WP1=NRCW+1 

WRITE (NCF,201) ( RCCT (I » , 1=1 ,NRO W J 
WRITE (NCF,202) ! RCC T ( J ) , J=NROWP 1 ,NT ) 

FORM ATdX, *ROW T OTAL S : * /201 3 ) 

FORM ATdX,* COLUMN TO TALS : */20I3 ) 

RETURN 

END 



SUBROUTINE TCS 

♦ (NTREAT,LIST,LPTR,CRV,NTORO,NROW,NCCL,RCCT J 
INTEGER NTREAT,LPTR( 17) , NTORD ,NRCW , RCCT ( 5 0) ,CIV(50) , 

♦ KEY2 (200 ),NPA IR, RCWN R (5 0 ) ,COLNR( 50 ) , NCOL , KE Y1 (200) 
REAL LI ST (400 ),VAL1( 200) , VAL2 (20 0 ) , CRV ( 5 0 ) 

WRITE(9, ICl) 

FORMATdX, *INTO TC3* ) 

CALL PAIRSKVALl , VAL 2 ,KEY1 , KEY2 , LIST , LPT R ,NP AIR) 

CALL TC3LN(1 , CRV , VAL 1 ,KEY1, N PAI R ,NROW ,NCCL,ROWNR) 
CALL TC3LN (0 , CRV , VAL 2 ,K EY2, N PAIR , NRO W ,NCCL, COLNR ) 
CALL RCCTR(NRCW,NCOL ,NPAIR,RCWNR , COLNR, RCCT) 

RETURN 

END 



SUBROUTINE TCA 

♦ (NTREAT ,LIST ,LPTR ,C I V , NTORD ,NROW ,NCCL ,RCCT ) 

INTEGER NTREAT, LPTR( 17) , NTORD ,NR OW ,NCCL , RCCT ( 50 ) , 
*KEY2 (20 0) ,NPA IR, RCWNR(5 0) ,COLNR( 50 ) , C IV ( 50 , K EY 1 ( 200 ) 
REAL LI ST (400) ,VAL1( 200) ♦ VAL 2 (200) , CRV (50) 

CALL PAIRST(VALi , VAL2,KEY1,KEY2, LI ST , LPTR ,NP A IR ) 

CALL TC4LN(1,CIV,KEY1,NPAIR,NR0W ,NCOL,ROWNR) 

CALL TC4LN(0, CIV ,KEY2,NPAIR ,NROW , NCOL, COLNR) 

CALL RCCTR(NRCW,NCOL ,NPAIR,ROWNR , COLNR, RCCT) 

RETURN 

END 

SUBROUTINE PAIRST 
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101 



10 

305 

302 

301 



C 

Cl 01 



10 

20 

203 



12 
2 04 

99 

C 



♦ (VALl, V;iL2fKEYl ,KEY2,LIST,LPTR, NPAIR) 

INTEGER N7R£AT,LPTR( 17) ,NT0 RD tNR OW ,NCCL , RCCT ( 50) ♦ 

♦ KEY2 (200 ) »NPA IR, RCWN R (50 ) »COLNR( 50), CIV ( 50 , 
*NRCELM,NTCP,I CIV tICRV ,NIN,I , J, JJ ,K, RCRC, K£Y1 ( 200 ) 

REAL LIST(400 ),VAL1( 200 ) ,VAL2(200) ,CRV(50) 

WRITE(9 ,101) 

FORMAT! IX, 'INTO PAIRST') 

NPAI R=LPTF(2)-LPTR(1 ) 

CO 10 I=1,^PAIR 
J=I+NPAIR 
V ALl ( I ) = LIST( ) 

VAL2( I )=LI 'T( ) 

KEYl ( I ) = I 
KEY2 I)=I 
CONT INUE 
URITE (9,335) 

FORMAT (IX,' VALUES OF VAL1,VAL2 BEFORE SCRT •) 
URIT£(9,301) (VALKI ),I = i,NPAIR) 

WRITE(9,30i) (VAL2(I ) ,1=1 .NPAIRI 
CALL SHSCRT(VAL1 ,KEY1,NPAIR) 

CALL ShSCFT(VAL2 ,KEY2,NPAIR) 
k*RITE(9,3C2) 

FORMAT (IX,* VALUES OF VAL1,VAL2 AFTER SCRT •) 
WRITE(9,30i) (VALKI ),I = 1,NPAIR) 

URITE(9,3CK (VAL2U ) ,I=1,NPAIR) 

F0RMAT(7F10.3 ) 

RETURN 

END __ __ 

SUBROUTINE TC3LN 

♦ (RORC,CRV ,VAL,KEY,NPAIR,NROW,NCOL,RCRCNR) 

INTEGER NTREA1,LPTR( 17) ,NTQRD,NR OW,NCCL, RCCTC 50) , 

♦ KEY2 (200) ,NPAIR,RCWNR(50) ,COLNR( 50 ) , C IV ( 50) , I K, 
*NRCELM,NTCP, ICIV , ICR V ,N I N ,I , J , J J ,K,RCRC , RCRCNR( 50) , 

♦ KEYl (200 ) ,K£Y (200 ) 

REAL LI ST(4G0 ) ,VAL(200) ,CRV( 50) 

WRITE (9,131)NPAIR 

FORMATdX,' INTO TC3 LN ,NPAIR;',I5) 

IK=1 

IF(RCRC. LT.l ) GO TO 10 
I CRV=1 
K=0 

NTCP=NFCW-1 
GC TO 2C 
CONT INUE 

I CRV=NPCW 
K=NR0W-1 

NT0P=NFCW+NC0L-2 

CONTINUE 

WRITE (9,2C3) IK,ICRV,K,NTOP,VAL(IK) ,CRV(ICRV) 
FORMATdX,' IK, ICR V,K,NTGP:' ,415, 

♦ ' VAL( K) ,CRV(ICRV) • ,2F10,3 J 

IF ( ( ICRV.GT.NTOP ).0R.( VAL( I K ) . L E. CR V ( IC R V) ) ) 

♦ GC TO 12 

I CRV= ICRV+1 
GO TO 20 

RORCNRlKEYdK )) = ICRV-K 

WRITE (9,204) KEY(IK), RGRCN R (KE Y ( I K ) ) 

FORMAT (1X,'KEY( IK)' , 15, ' RORCNR ( KEY ( IK ) ) • , I 5 ) 

I K=IK + 1 

I F ( I K.GT.NPAIR) GO TO 99 
GO TO 20 
RETURN 

END __ _ __ — 

SUBROUTI NE RC CTR ( NRO W ,NCCL, NPAI R , ROW N R ,C C LNR , FCCT ) 
CIM 
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101 

10 

20 
C 

10 

20 

15 

30 

C 
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INTEGER MREAT,LPTR( 17) , NT0RD»NR0l< f NCCL , RCCT ( 50 ) , 
«KEY2 (200 ) fNPA IR, RCWNR(50 ) ,COLNR( 50 ) t C JV( 50) tCELLNR, 
*NRCELM, NTCPtI CIV, ICRV ,NIN,I , J,J J,K,KEY1( 2C0) 

REAL LISKAOO ) ,V ALl ( 2 CO ) , VAL 2( 20 0 ) ,CR V( 5 0 ) 
kRITE (9,101) 

FORMATdX, ' INTO RCCTR*) 

NRCELM=NRCW*NCQL 
DO 10 1=1 ,NRC£LM 
RCCT( I )=0 
CGNT INUE 
CO 2 0 1=1 ,iNPA IR 

CELLNR=(ROV'NR ( D- 1 )*NCOL>CCLN R( I ) 

RCCTi CELLNF)=RCCT (CELLNR)+1 
CONTINUE 
RETURN 
END 



SUBROUTINE TC^LN ( BOR C ,C I V ,KE Y,NP A IR , NROW , NCOL ,RQRCNR ) 
INTEGER NTREAT,LPTR( 17) , NTORO,NR CW , NCCL, fiCCT(50), 
*KEY2 (20 0 ) ,NPA IR, RCWNR(50) ,COLNR( 50) ,CIV( 50 , 
*NRCELM,NTCP, ICIV , ICR V ,N I N , I , J, J J ,K , RCRC, RCRCNR( 50), 
*KEY1 (200) , KEY (200) 

PEAL LIST(AOO ),VAL1( 200),VAL2(200),CRV(50) 
NRCELM=NBCy»*NCOL 
IF ( RORC.LT. 1 ) GO TO 10 
NTCP=NFCW 
I CIV=1 
K=0 

GO TO 20 
NTOP = NROl< + NCOL 
I CIV = NfiCW-H 
K=NROVs 



J=0 

CO 30 I=ICIV,NTOP 
NIN=CIV(I) 

IF (NIN.EQ.O) GO TO 30 
DC 15 JJ = 1 ,NIN 
J = J+1 

RCRCNP(KEY(J ) )=I-K 
C QNT I Mj E 
CONTINUE 
RETURN 
END 



SUBROUTINE DICOT ( NOF ,NTORD, T YPCE L , I P 1 ,CR V ,LI S T ,LPTR , 

* TOTELM,PCCT ,TIEV) 

INTEGER DICOP ,LPTR(1 7) , UL,I , NSUC C ,L , 0 ,KE YK400) , 

* NTORO, TYPCEL,NPAIR ,IP1,RCCT(50 ) ,NFAIL, I ,TIE V(2 ) , 

» KEY2(400) ,TCTELM 

REAL 01 VAL(20C) ,CRV( 200) ,LI S T ( TO TELM ) ,PT ,OIF ,ADIF, 

* EPSILN 
NSUCC=0 
TIEV ( 1) = 0 
TIEV (21=0 
EPSI LN=0.CC01 
NPAI R=LPTP(2)-1 

RRITE ( 9 ,206) (CRVd ) ,I=1,N70RD) 

FORMAT( IX, 'CRV',7F10.3) 

•THIS SECTiCN IF OICOTOMIZE BY PARTITION 

IF ( TYPCEL.EQ.2) GO TO 40 
PT=CRV(i) 

DC 10 I=1,^PAIR 

IF (L IST( I). GT.PT) NSUCC=NSUCC+1 
CIF=AES(LIST (I)-PT) 

IF (0 IF.GT. EPSILN) GO TO 8 

IF (LISTd ).LE.PT) TIEV(1) = TIEV( 1) + 1 
IF (LISTd) .GT.PT) TIEV( 2)=TIEV( 2J + 1 
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41 



42 



50 



2 09 



60 



99 

101 



WRITE (9,207 J L ISTd ) »P T ,NSUCC 
FCRMATdX ,*L IST( I ), PT,NSUCC ,2F10.3, I5J 
CONTINUE 
GC TO 99 

THE FOLLOWING SECTION IF DICOTOMI 2E BY LIST 

CONT INUE 
00 41 I=1,NPA1R 
KEYK n = I 
CONT INUE 
CO 42 I=1,NT0R0 
KEY2( I »=I 
DIVAU I)=CFV( I ) 

CONTINUE 

CALL SHSCPTILIST f KEYlfNPAIR) 

CALL SHSCFT(01VAL,KEY2f NTORO ) 

L=1 

0=1 

IF{( L.GT.NFAIf ) .0R.( C.GT.NTQROn GO TO 99 
OIF=LIST(L)-DIVAL(D) 

ADIF=AeS(OIF) 

WRITE (9»2C9) L IS T (L ) , DI V AL( 0 ) » D I F ,NS UCC 
FORMAT (IX, *LIST,0 I VAL.OIF ,NSUCC* ,3F10.5, I5i 
IF ( ACIF.G7.EPSILN) GO TO 60 
NSLCC=NSUCC+1 
L=L+1 
GO 7C 5C 

IF(DIF.GT*C) D=0+1 
IF(OIF.LT.C) L = L-H 
GG TO 50 

WRITE (9,lCli NSUCC.NPAIR 

__ * ,15 , • OF • ,15) 



FORMAT! IX,' » OF STANDARD SUCCESSES 
IF (IPl.ti.O) N5UCC=NPAIR-NSUCC 
C#=«‘*REFLECT THE 4 OF SUCCESSES IF LIST CONTAINS FAI LURES^^f * 



OR VALUES ABOVE PARTITION ARE FAILURES 
WRITE (9,105) NSUCC.NPAIR 
WRITE (NCF,10f) NSUCCfNPAIR 



I 5) 



105 FORMATdX,' NUMBER OF SUCCESSES *,I5,' OF 
NFAI L=NPAIR-NSUCC 
RCCT (1)=NSUCC 
RCCT (2) =NFA1L 
RETURN 

END _ __ 

SUBROUTInI MCCUNT (LIST, LPTR, CRV , RCCT, TREAT) 

C THIS GETS COUNTS FOR FOUR CELLS OF THE MCNEMAR TEST; 

C 2 SUCCESS, 2 FAILURE, SUC CESS-FAI LURE , FAILURE-SUCCESS 
C FBS=1.0 IF SUCCESS IN FIRST TREATMENT IS BELCW CRV(l), 
C =-1.0 IF SUCCESS IS ABOVE CRV( 1 ) 

REAL LIST(400 ),CRV(5 0), FBS,S8S,FPV,SPV,F ,S 
INTEGER LFTR( 17) ,RCCT(4) ,N,K,I ,LF,LS,TREAT(16) 
N=LPTR(2 )-l 
FBS=CRV( 3) 

S8S=CRV( 4 ) 

FPV=CRV( 1 )«FBS 
SPV=CRV( 2)«SBS 
LF=1+IF IX(CRV (5) ) 

LS=2-IFI X(CRV (5) ) 

CO 5 1=1 , A 
RCCT( I)=0 
5 CONTINUE 

CO 10 I=1,N 
K=1 

F=LI ST(I)«FBS 
S = LIST ( I+N )*SBS 
IF(F.GT.FPV) K=K+LF 
IF(S.GT.SPV) K=K+LS 
RCCT(K)=RCCT(K)+1 
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10 



CQNT INUE 

IF ( CRV( 5 ).LT.0,5 » GO TO 20 
URITE (NCFtlll) TREAT! 1) ,TR£AT(2 ) 

GO TC 30 

20 V^RITE (NCFtllll TREA T(2 ) tTREAT! 1 ) 

111 FORMATdX, 'TRtATEMENTS: */• 6£F0RE',I5,' AFTER' »I5) 

30 RETURN 
END 

SLBRQUT INE MS TRDRCNVARS ♦ NELMS, L I ST , L PTR, TOT ELM, 
NTREAT,NIP,I F ,NRP ,R P ,NI F ,NOF , TREAT) 

C This SUBPfiCGRAh GROUP READS ALL Or THE DATA FILE EXCEPT 
C THE NAMES, AND LihES 4-7 OF THE OPTIONS FILE. IT CHOOSES 
C THE DATA ELEMENTS OF THOSE VARIABLES INCLLOED IN THE TEST, 
0 AND PUTS THEM INTO THE VECTOR LIST. IT THEN REMOVES THE 
0 DATA ELEMENTS IF IHE USER INDICATED TO EXCLUDE PAIRS OR 
C BLOCKS. THE INTEGER VECTOR LPTR DIMENSIONS LIST; LPTR(l) 

C = l; LPTRd) = 1ST ELEM CF ITH VAR IN LIST. NTREAT = 

C TREATMENTS/VARS IN TEST; TOTELM, I S TOTAL ELEMS IN LIST 
C LPTR (NTREAT + U = TOTELM+1. NELMS IS INT VECTOR OF ELMS 
C PER VAR AFTER BLOCKS/PAI RS EXCLUDED; TREAT IS INT VECTOR 
C OF INDICES OF ORIGINAL DATA SET WHICH ARE INCLUDED IN TEST 
C IP IS INT VECTOR Cr INT PARAMS; RP IS REAL PARAMS 
C 

C SUBROUTINES 

C MSTRDR READS THE FILES 

C SUBLST PREPARES DATA ELEMENTS FOR REDUCTION 
C NULIST INCLUDES CNLY TREAT SPECIFIED INTO LIST 
C EXBLCC EXCLUDES FAIRS OR BLOCKS AS REQ 

INTEGER NIF,NFILE,IHOLD( 16) , NVAR S , TOTELM , EXB ( 200 ) , 
*NTRE AT, I P (50) ,LPTR ( 1 7 ) , NELMS ( 16 ) ,RNELMS( 16),RNVARS, 
*DICOP,NSLCC,CELLS ,NTEST,TREAT(16 ) ,REDELM,NRP 
REAL RP ( 1C ) ,L 1ST ( 400 ),DIVAL( 200) 

101 F0RMAT(16I5) 

102 FORMAT! 8F1Q. 3 ) 

201 F0RMAT(1X,16I5) 

202 FORMAT!! X ,EF1 C.5 ) 

DIC0P*0 

CALL FNAME (1 ,NIF,NOF) 

CALL FNAM5 (2,NIF,9) 

CALL FNAMS (2,NIF,N0F) 

READ !NI F ,101 ) ( IHOLD! I ) , I =1 , 1 6 ) 

NVARS=IHCLD( 1 ) 

LPTR (1) = 1 
T0TELM=0 
CO 10 I=1,NVARS 
I P1*I+1 

NELMS! I) = IhOLD!IPl) 

LPTR! IP1) = LPTR!I)+NELMS!I ) 

TCTELM=TOT£LM+NELMS( I ) 

10 CONTINUE 

READ !NI F, 102 ) (LIST! I ) ,1 = 1, TOTELM ) 

CALL SUB L ST !L 1ST , LPT R,N VARS, NTREAT, TOTELM , TREAT ) 

CALL RPARAM!N IP,NRP, IP,RP) 

WRITE !9,4Q7) 

407 FORMATdX,* LISTPTR VECTOR AND LIST AFTER REDUCTION') 
WRITE!9,201) (LPTR! I ) ,I =1 ,17 ) 

WRITE (9, 2021 (LIST (I ) ,1=1, TOTELM ) 

WRITE!9 , 202) 

303 FORMATdX,' N VAR S , TOTELM ,NTE ST , N TREA T ,NE X B , N I F ,NRP' ) 
WRITE!9 ,2C1» ^ VARS ,T OTE LM ,NT EST , NTRE AT ,N EXB, N IP, NRP 

RETURN 
^ END 

SUBROUTINE SU ELST ! LI ST, LPTR , NVAR S ,NTREAT , TOTE LM, TREA T ) 
C THIS SUBROUTINE READS THE VALUES FOR THE INCLUDED TREATS, 
C AND EXCLUDED SLOCXS, IF ANY, AND REDUCES THE ORIGINAL 
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C DATA SET OF THE D;TA FILE INTO THE DATA LIST FOR THE TEST 
C SUBROUTINE NULIST INCLUDES TREATMENTS, EXBLOC EXCLUDES 
C TREAT^'E^TS 

REAL LIST(4CQ> 

INTEGER LPTRI 17J , NVA RS, NTRE A T,TO TELM , TRE A T( 1 6 ) ,Nc XB , 
«RNELMS( 16 ♦ ,RNVARS ,EX e (200 j 
FORMAT( 1615) 

F0PMAT(8F1G.3 ) 

FORMAT( IX ,161 5) 

FORM AT( 1X,SF1C.3 ) 

CO 8 1 = 1 ,^VARS 
TREATd ) = 0 
CONT INUE 

PEAD(10,101) ^TREAT 
WRITE (9,201) NTREAT 

FORMAT! IX, ‘NURBER OF TREATME NTS ' , 15 , • INDICES*) 

READ (10,1 Cl) (TREAT! I ) , I =1 , NTREA T ) 

WRITE(9,2C1) (TREAT! I ) , I =1 , N TRE A T ) 

REA0(10,1C1) ^EXB 
WRITE ( 9 ,2C2) NEXB 

FORM AT( IX , 'NUMBER OF EX BLOCKS', 15,' INDICES') 

IF (NEXB.GE.l) R£AD(10,101) ( EXB ( I ), 1=1 , N cXB ) 

IF (NEXB.GE.l) WRITE(9,201) ( EXB ( I ) , I =1 , N EXB ) 

CALL NULIST 

* (NELMS ,T07ELM, LI ST, NTREAT , TR EAT , LPTR , PNVAPS) 

IF (NEXB.GE.l) CALL EXB LOC( L PTR , TOTELM ,L I ST , NTRE AT, 

« NELMS, NEXe, EXB, RNELMS) 

RETURN 
END 



101 

102 

201 

202 



301 



3 02 



C — 



15 

10 



408 



20 

98 






SUBROUTINE NULIST 

« ( NELMS,T0TELM,LIST,NTREAT ,TREAT,LPTR, RNVARS) 

INTEGER MF,NF1LE,IHCL0(16) ,TOTELM,NTEST, EXB( 200 ) , 

« NTRE AT, I P( 50) ,NRP ,LP TR ( 17 ) , N ELMS ( 16 ) ,NULPTR(17) ,KP1, 
4=RNVARS,NCF, 1 1 ,UL,LL, NVARS, TREAT! 16) 

REAL RP ( 1C) ,L 1ST (400 ) 

IF(NVARS. EG. NTREAT) GO TO 98 
CO 5 K=1 , 17 
NULPTR(K)=C 
CONT INUE 
1 = 0 

NULP TR( 1 )=1 
CO 10 11=1, NTREAT 
I IP1 = II + 1 
K = TR£AT(II ) 

KP1=K+1 

LL=LPTR(K) 

UL=LPTR(KP 1)-1 
DC 15 j = LL ,UL 
1 = 1 + 1 

L 1ST! I)=LIST ( J) 

CONTINUE 

NULPTR! I IF1)=LL+1-LL+NULPTR( II) 

CONT INUE 
TOTELM=I 

WRITE(9,AC2) (NULPTR ( I ), 1=1 , 17) 

FORMAT! IX, 'NULPTR* ,/ 1615 ) 

LPTR(1)=1 
CO 20 K=l,17 

LPTR! K )=NULPTR(K) 

CONTINUE 
RNVARS=NTREAT 
RETURN 
END 



SUBROUTINE EXELOC 

* (LPTR, TOTELM, LI ST, NTREAT,NELMS , NEXB , EXB , RNELMS ) 
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INTEGER MF.NFILE ,IH0LD(161 , TOTE LM.NTEST , EXB(200 J ,1, 
=J‘NTRE AT, I P(50) ,NRP ,LPTR( 17), NELMS ( 16 ) , EXI ( 200 ) ,NcXI ,K 
♦ RNELMSI 16) ,TEF1,NCF, LL,UL,NVARS, TREAT (16 ) , EXT FIS, J 
REAL RP( 10) ,L1ST( AOO) 

IF ( NEXB.EC.O ) GO TO 97 
K = 0 

C GET INDICES CF LIST ELEMENTS TO BE EXCLUDED 
LPTR (1)=1 
CO 10 1=1 ,NTREAT 
DO 12 J=1,NEX8 
K = K + 1 

£XI(K)=LPTR( I) + EXB(J)-1 
12 CONTINUE 

10 CONTINUE 
NEXI =K+1 
EXK NEXI ) = 17 
K = 1 
J=0 

TEPi=TOTELM+l 

C RELOAD LIST, EXLUCING ELEMENTS AS INDICATED BY EXI 
00 20 I=1,TEP1 
EXTHIS=£XI (K) 

I F ( I •L7,E>THIS) J=J+1 
IF ( I .LT.E>THIS) LIST( J)=LISTU) 

IF ( I .C-E.E>TH1S ) K=K + 1 
20 CONTINUE 

C CHANGE LIST PC INT SR, NUMB ER OF ELEMENTS 
LPTR (1) =1 

TOTE LM=TCTcLM-N£Xe*NTR£AT 
CO 25 I=1,NTR£AT 

RNELMS(I)=N£LMS(TP£AT(I) ) -NEX B 
2 5 CONTINUE 

NTPl =NTREAT + 1 
DO 30 1=2, NTPl 

LPTR ( I ) = LPTR( I )-( I-l )*NEXB 
30 CONTINUE 

LL=NTREAT+2 
CO 40 K=LL,17 
LPTR( K) =0 
40 CONTINUE 
GO TC 99 

97 DO 98 1 = 1 ,NTREAT 

RNELMS( I) = NELMS(TREAT(I) ) 

98 CONTINUE 

99 RETURN 
END 

C THIS SECTION CONTAINS THE SUBROUTINES CALLED BY MCRE THAN 
C ONE TEST SUBROUTINE. THEY APPEAR IN ALPHABETICAL CRDER: 

C EXPF, FINOS, FINDT, MUVA R , NCRMF, NORMV, PARCCM, RANQUE 
C RHOER, RNQ, RFARAN 

SUBROUTINE EX FF( L 1ST , N, RP ,F , NOF ) 

C THIS SUBROLTINE FINDS THE THEORETICAL EXPONENTIAL 

C OISTIBUTICN F, WHERE L=RP (1 ) =SCA L£ PARAMETER LAMBDA 

REAL A, L ,>,LI ST( 400) ,RP(50) ,F(400) 

INTEGER I 



X=LIST( I) 

IF(X.LE.O) F(I)=0 

IF(X.GT.O) F( I )=1.0-5XP(-(L*X) ) 

10 CONTINUE 

k>RITE(N0F,101 ) L 

101 FORM AT( IX, 'WITH HYPOTHESISED 01 5 TRI 3 UTUI CN *, 

* ' EX PONENTIAL */lX ,'W ITH PARAMETER LAMEDA= ', F1C.4) 
RETURN 



END 



SUBROUTINE F I NOS ( L IS T ,L PTR, S tK ) 

C THIS SUBROUTINE FINOS THE DIFFERENCES BETWEEN TWC 
C EMPIRICAL CDFS FOR THE TWO VECTORS CONTAINED IN LIST 

C 1ST VAR X HAS H ELMS M AND COUNTER I; 20 VAR Y HAS # ELMS 

C N AND COUNTER J; E CDF ARE (I-ii/M AND (J-D/N; STEP 

C COUNTER OF SMALLER VAR UNTIL N+1 OR M+1, THEN INCfi OTHER 

REAL EPSILN, FN,FP,X <100) ,Y( 1001 , 01 F , ADI F ,S ( 2 00 ) , 

* ATMAX,ATMIN,TSUP, POI F , LI ST(40 0) f RP ( 50 ) »TMAX,TMIN 
INTEGER ItJ,K,LfM,N,KEYI(100),KEYJ( 100) f JJf IP(50)t 

* LPTR (17) ,NCF, TREATd? ) ,II 
EPSILN=O.C0Ol 

P=LPTR(2 )-l 
N=LPTR( 3 )-LPTR(2) 

FN=FLQAT (N ) 

FM=FLCAT(f') 

DO 10 1 = 1, M 
KEYI ( I ) = I 
X( I) =LIST( I) 

10 CONTINUE 

CALL SHSCRT(X,KEYI,M ) 

00 12 J=1,N 
KEYJ ( J )=J 
Y ( J) =LIST( (H+ J) ) 

12 CONTINUE 

CALL SHSCRT(Y,KEYJ,N ) 



20 

101 

111 



25 



30 



K=0 

L=0 

1=1 

J=1 

CONT INUS 

IF ( ( I.GT..'*) .^ND. (J.GT.N ) ) L = 1 

FORMAT! IX, 'ITERATION # , I , J, STOP? • ,41 5 , • 0IF:',Fi0.4) 
F0RM^T(1X,'S<K): • ,F10.4) 

II=I 

IF(I.GT.M) II=M 
JJ=J 

IF(J.GT.N) JJ=N 
0IF=X(II )-Y( J J) 

4DIF=ABS (CIF) 

IF(ADIF. GT.EPSILN) GC TO 25 
IF(I.LE.M) 1=1+1 

PDIF=FLCAT (I-1)/FM-FL0A7( J-1) /FN 
K=K+1 
S (K)=PDIF 

WRITE (9,1C1) K,I,J,L,DIF 
IF (K.GT.O) WRITE (9,111) S(K) 

IF(J.LE.N) J=J+1 
K=K+1 
S (K)=PDIF 

WRITE (9,1C1) K,I,J,L,DIF 
IF (K.GT.O) WRITE (9,111) S(K) 

IF(L.EC.l) GO TO 30 
GC TO 20 
CONT INUE 

I F((DIF.LT.O).AND.(I.LE.M)) 1=1 + 1 
I F((DIF.GT.O) .AND. (J.LE.N) ) J=J + 1 
K = K+1 

S (K)=FLCAT (I-l )/FM-FLOAT( J-1 ) /FN 
WRITE (9,1C1) K,I , J, L,DIF 
IF (K.GT.O) WRITE (9,111) S( K ) 

I F(( DIF.LT .0) . AND. (I. GT.M).AND.( J.LE.N)) J=J+1 
I F( ( DIF.GT .0) .ANQ. ( J.GT. N).ANO. (I . LE. M)) 1=1 + 1 
IF(L.LT.l) GO TO 20 
RETURN 
END 
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SUBROUTINE FI ^0T ( S ,F , N, T SUP , TMI N , TMAX ,N0 F J 
THIS FliMCS THE DIFFERENCE IN CDF'S OF THECRETICAL 
DISTRIBUTION F AND EMPIRICAL 01 S T S AT THE X*S DETERMINED 
BY THE VALUES CF THE SAMPLE DATA VARIABLE ELEMENTS. THE 
MAX ABSOLUTE CIFFERENC IS TSUP. T IS VECTOR OF DIFFERENCE: 
REAL S( 400 ,F (40 0) ,TSUP ,ATMIN,ATMAX,TMIN , TMAX ,T(80G) 
INTEGER N,I,II,NOF 
NP1=N+1 
T( 1) =F( 1 ) 

T(2) =F(1 )-S(l ) 

CO 10 11=2, N 
1=11-1 
I T2=( II=>2) 

I T1=I T2-1 
T (ITl ) = F(I I)-S( I) 

T(IT2)=F(II)-S(II I 
10 CONTINUE 
TMIN=2.0 
TMAX=-2. C 
CO 12 J = 1, IT2 

I F(T ( J) .GT .TMAX) TMAX=T(J) 

IF(T( J).LT,TMIN) TMIN=T(J) 

12 CONTINUE 

ATMAX=ABS(TMAX) 

ATMI N=A8S^TMI^) 

TSUP=ATMIM 

IFCATMAX.GT.TSUP ) TSUP=ATMAX 
V<RITE(NOF,101 ) TMAX, TMIN, TSUP 
101 FORMATdX, 'MAXIMUM DIFFERENCE, F ( X ) -S (X ) , ', F 1 C. 4/ IX, 
MINIMUM DIFFERENCE : ',F10.4/1X , 

♦'LARGEST ABSOLUTE 01 FFERENC E : ' , F 10. 4 ) 

RETURN 

END __ __ 

SUBROUTINE MU VAR ( LIS T ,N , MU, V AR) 

: THIS SLBROLTINE F INOS THE SAMPLE MEAN AND VARIANCE 

REAL LIST(400 ),MU,VAR,XBAR,SUMSQ 
INTEGER N 
XBAR=0. 0 
SUMS C=0. C 
DO 10 1=1 ,N 

XEAR = XEAR-»LIST( I) 

10 CONTINUE 

MU=X8AR/FLCAT (N) 

CO 2 0 I = 1,N 

SUMSQ=SLMSC+( LIST ( I )-MU)**2 
20 CONTINUE 

VAR=SUMSC/FLQAT( N-1) 

WRITE (9,101) MU, VAR 
WRITE (NGF.IOI) MU.VAR 

101 FORMATdX, * ESTIMATED MU' ,F10. 3, • VARIANCE ', FIO . 3) 
RETURN 
END 



C — 

C 

C 



SUBROUTINE NO PMF ( LIS T ,N , BP, F ,NOF ) 

THIS SLBROLTINE FINOS THE THEORETICAL NORMAL 
DISTIBUTICN F, WHERE MU = RPd ) , S I GMA SGUAR ED=RP( 2 ) 
REAL PX,MU,SIGSQ,X,LIST(400) ,RP( 50) ,F(400) 

INTEGER I 
MU=R P(1 ) 

SIGSQ=RP(2) 

CO 10 1 = 1, N 
X = LI ST( I) 

Z=(X-MU )/S IGSQ 
CALL NCRMV (1,Z,PX ) 

F ( I) =PX 
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10 CONTINUE 

WRIT E(NOF, 101 ) MUtSIGSQ 

101 FORMATdXf'WiTH HYPOTHESISED CIS TRIBUTUION NQRMAL'/iX, 
*‘WITH PARAMS ^U= • ,F1 0. 4 , ' » AND SIGMA SQUARED= • ,F10.4 ) 
RETURN 
END 



C 

C 

C 

C 



SUBR 
IF OPTIO 
A P, IF 
NUMERICA 
INITIALI 
REAL 
*DELX 
INTE 
FI =3 
CUMP 
CONS 
CLD= 
IF ( 
Z 
U 
D 



OUTINE NOFMV(GPTIGN,ZtP) 

N=1 » THIS TAKES A ZISTANCARD NOR^ALi AND RETURNS 
0PTIQN=2» TAKES A P AND RETURNS A Z 
L INTEGRATIONt TRAPAZGIDALt STEP=0.1 STD 
ZE 6CTH Z AND P IN CALLING SUBROUTINE 
Z,P ,PI,CCNST,OLD,NEW,ZZ,PP» CUMP ,X,OELTP,EXCESP, 



CEV 



104 

10 



12 



50 



60 



1 09 



95 

101 



C-' 

C 

C 

C 



c 

0 

N 

C 

w 

p 

I 

G 

CONT 

P 

X 

I 

X 



I 

E 

Z 

z 

I 

WRIT 

FORM 

RETU 

END 



GER CPTIONtIfJfU 
.1415927 
=0 

T=1.C/(2«FI)**0. 

CONST 

OPTION. EQ. 2) GO 
Z=A8S(Z) 
L=IFIX(ZZ=*10,0i 
0 10 I=liLL 

X=FLQAT( I )*0 
NEW=CCNST=»EX 
CLMP=CUMP+0. 
CLC=NEW 
IF (CLMP.GT. 
WRITE (9,104 
FCRMAT(1X,'I 
GNTINLE 
ELX=22-X 

EW=CCNST*cXP(-0. 
UMP=CLMP+C.5#DEL 
RITE (9,1C4) I,N 
=CUMP+C.5 
F ( Z .LT.O .0) 

0 TO 95 
INUE 
P=P-0.5 
= 0 . 0 

F (P. LT.O. 5) 

= X+0 .1 

NEW=CCNST*EX 

DELTP=0. 05#( 

CLMP=CUMP+DE 

CLD=NEW 

WRITE(9,1C9) 

FCPMATdX,' 

F (CUf'P.LT.PPI G 
XCESP=CUMF-PP 
Z=X-0.1#E>CESP/D 
= ZZ 

F (P. LT.O. 5) 2=- 

E ( 9,1C1 » OPTION 
AT(1X,' OPTION, 
RN 



L,ONE 



TO 50 



.1 

P(-0.5*X*X) 

05*(OLD+N£W) 

0.481 GO TO 12 
J I, NEW, CUMP, X 



•,I5,* NEW,CUMP, XS3F7.3) 



5«ZZ«ZZ) 

X«( OLD+NEW) 
£W,CUMP, X 



P=0 .5-CUMP 



PP=C.5-P 



P(-0.5*X«X* 

CLD+NEW) 

LTP 

X, NEW, CUMP 
X ,NEW,CUMP* , 3F7.3 ) 
C TO 60 



ELTP 

ZZ 

,P,Z 

PROS, 



VA 



L OF Z: * ,15, 2F10.2) 



SUBROUTINE PA FCOM ( V, T ,S , NCF I 

THIS FIND THE DIFFERENCES BETWEEN THE I TH AND JTH ELEMENT 
OF VECTOR V, FCR ^LL lOJ, 1,...,T; PRINTS THE DIFFERENCES 
AND THE SUPPLIED STANDARD DEVIATION S 
V( 17 ),S 

GER NCF,T ,I,J ,TM1,IP1 
E(NOF,101) S 



101 



REAL 
INTE 
WRIT 

F0RMAT(9X, '//PAIRWISE COMPARISONS//*/lX, • IF AND ONLY', 
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OOOOOOOOOOOOOOOOO 



« • 
♦ • 
^ • 
« • 
♦ • 
^ • 
♦ • 



102 

20 

10 



IF THE NULL HYPCTH 
COMPARISCNS HAVE M 
DESIRED CUAMILE 0 
TIMES TFE STANDARD 
WITH THE FOLLOWIN^G 
SUMS OF ROW RANKS: 
I J |SR(I» 

TM1=T-1 
CO 10 1=1, TMl 
I P1=I+1 
DO 20 J = IP1,T 

X = V( I J-V( J) 
A>=ABS(X ) 
WRITE <NQF ,10 
FCPMATI1X,2I 
CONTINUE 
CONT INUE 
RETURN 
END 



ESIS IS REJECTEDVIX, • THESE ' , 
EANING. COMPARE THE*,/ IX, 

F THE T DISTRIBUTION* /IX , 
DEVIATION * ,F10.4,/ IX, 

ABSOLUTE DIFFERENCES IN THE*/1X 
*//lX, 

-SRI J)1 '/ I 



2J I,J,AX 
5,3X,F10.4) 



SUBROUTINE RANQUE (NV ALS , VALS ,SVALS,RANK, Q ,KEY ,NTREAT , 

* NBLCCK,CPTICN) 

DIMENSION VALSINVALS 

#SVAL SIN VALS) , KEY! NVAI 
INTEGER KEY, NEWT IE, 01 

* JJV,NL, JJ,QKEY 
REAL DI F , VALS ,SVALS, RANKl ,R ANK , A V E , E FSI LN ,RANC, Q , RAN GE 



),Q(200i ,RANK(NVALSi ,BANQI400i, 
LS) ,JJV(400) , RANGE (20 0) , QKEY (200 ) 
LOTIE,LL , NTR EAT, NBLOCK, OPTION, 



THESE TWO SUBROUTINES RANK VALUES 



Q 

OIF 

EPSI LN 

NEWTIE 
OLOT IE 
KEY 

RANK 

LL 

AVE 

RANG 



THE RANK 
DIFFERENC 
TOLERANCE 
SAY TWO 
KTH AND K 
K-ITH AND 
ORI GINAL 
GF SNALLE 
RANK CF T 
LOWEST IN 
AVERAGE 0 
IF OPTION 
=3 RANKS 



CF RANGES 
E BETWEEN 



WITHIN BLOCKS 
KTH AND K+ITH 

I.E. 



ARE 

ARE 



CF ROUNDING ERROR, 

VALS ARE TIED 
■HTH SORTED VALUES 
KTH SORTED VALUES 
INDEX OF VALS, E.G., KEY! 
ST VAL, K£Y( NVALS) HAS I 
HE UNSORTEO (ORIGINAL) 
CEX OF TIED SORTED VALU 
F LL C HIGHEST INDEX OF 
=1, TOTAL SET OF RANKS, 
W/I BLOCK PLUS RANKS OF 



SORTED VALUE 
, CLOSE ENOUGH TO 

TIED (LOGICAL INT) 
TIED (LOGICAL INT) 
1) HAS ORIG INDEX 
NDEX CF LARGEST VA 
VALUES 
ES 

TIED SORTED VALUE 
=2 RANKS W/I BLOCK 
RANGES Of BLOCKS 



10 

15 

51 



IF ( CPTICN.GE .2) GO TO 51 

NL=N VALS 

CO 10 1 = 1, NVALS 

S VALS( I ) = VALS (I) 

K£Y( I )=I 
CONT INUE 

CALL RNQ (SVALS, KEY, NL, RANG) 



CO 



15 1 = 1, NVALS 



I ) 



25 



RANK( I )=RANQ( 

CONTINUE 
GO TO 95 
NL=NTREAT 
DO 20 I=1,NBLCCK 
QKEY ( 1 ) = I 
DC 25 J=1,NL 

UJ=I+( J-1) UNBLOCK 
SVALS( J )=VALS( JJ) 

JJV( J)=^J 
KEY( J )=J 
CONTINLc 

CALL BNC ( SVALS, KEY, NL,RANQ) 
RANGE! I ) = SVALS(NL )-SVALS( 1) 
WRITE (9,1C7) (RANC(K) ,K=1,NL ) 
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1 07 FORMAT (IX , 7F10. 3 ) 

DC 30 J=lt^L 

RANK( JJV( J))=RANQ(J) 

30 CCNTIME 

20 CONTINUE 

IF ( aPTICN,£Q.3) CALL RNQ (RANGE, GKEY , NBL OCK , Q J 
95 RETURN 

END _ 

SUBROUTINE RHCER ( X ,N PAI R , RHO , B1 , BO, OPTION) 

C THIS SUBROUTINE TAKES X, A VECTOR COMPOSED OF TWO EQUAL 
C LENGTHED VARIABLES, AND FINOS THE CORRELATION COEFFICIENT 
C RHO (PEARSON'S: C CVAR I ANCE/ STOE V1*ST 0EV2 ) 

BEAL X(40C),RF0,SSA, SSB , FN, A B,DA , 08 , AMEA N ,6ME AN, 
#B1,80,RHC ,CENCM 
INTEGER NFAIR, OPTION 
FN=FLOAT (NFAIR) 

AMEAN=0. C 

BMEAN=0.0 

SSA=0.0 

SSB=0.0 

A8=0 .0 

B1=0,0 

E0=0,0 

CO 10 1 = 1 ,NPA IR 

AMEAN=AMEAN+X( I) 

BMEAN=eHEA^+X( ( I+NPAIR)) 

10 CONTINUE 

AMEAN=AMEAN/FN 
EM£AN=BMEAN/FN 
CO 20 1 = 1 ,NPAIR 
DA=X ( I )-AM£AN 
DB=X( ( I+NPAIR ) »-BPEAN 
AB=AB+(OA«CB) 

SSA=SSA+( DA*DA ) 

ssa=ssE+(oe*OB ) 

20 CONTINUE 

DENOM=( SSA**0.5) *(SSB**0,5) 

RHO=-AB/DENCM 

IF (OPTION. EQ .1) 81=AB/SSB 
IF ( OPTION. EQ.l) B0=AMEAN-B1*BMEAN 
IF ( OPTION. EQ. 2) B1=AB/SSA 
IF ( OPT ICN.EQ .2) 60= BMEAN-B 1*AME AN 
^^RIT£(9 , 122) FHO,cO,Bl 
122 FORMATdX,' R FO, BC ,B i : • , 2F1 0.4) 

RETURN 

END 

SUBROUTINE RN C ( SVAL S ,KEY ,NL ,RAN Q ) 

C THIS SUBROUTINE SORTS AND RANKS A VEOTCR FROM RANQUE 

C OR ANY OTHER SUBROUT INE; SVALS ARE UNSORTEC VALUES V^HICH 
C WHICH ARE RETURNEE SORTED AND DETERMINE RANKS; KEY IS THE 
C VECTOR 1,...,NL WHERE NL IS # ELMS IN SVAL; RANQ IS RANKS 
C OLDT IE,NEWTI E ARE LOGICALS TO START AND END TIED VALUES 
REAL SVALS(NL ),RANQ( NL) , EPSILN, 

♦AVE, DIF 

INTEGER NL,I,LL, I PI, NEW TIE ,0 LDTI E,KEY(NL),K 
EPSI LN=0.0001 
WRITE(9, 222) 

222 F0RMAT(10X,'*<** START RANKING IN RNQ 

OALL SHSOBT( SVALS , KEY ,NL) 

CLOT IE=0 
CO 10 I=1,NL 

RANQ( I )=FLCAT( I) 

10 CONTINUE 

CO 30 1=1 ,NL 
NEWTI E=0 
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K=K£YU ) 

R;1NQ{K >=FLCAT( I) 

C I F I I .GT.i ) WRITE (9,104) I , K , RANG (Ki , PANQ ( I » 

C104 FORMAT (1X,*I,K » , 21 5 , • RANQ(K) *tF10.3, 

C ♦ • RANC( I )• ,F1C.3 ) 

I P1=I+1 

OIF=SVALS( IP1)-SVALS( I ) 

IF ( I.EC.NU 0IF=1.0 
IF ( DIF.LT.EPSILN I NEWTI£=1 
IF ( NEUTIE .EQ.CLO TIE ) GO TO 29 
IF (NETTIE .LT .GLUT IE ) GO TO 15 
LL = I 

GC TO 29 

15 AVE=0 .5*(FLOAT(I+LL) ) 

OC 20 J = LL ,I 

PANQ(K£Y(J))=AVE 
20 CONTIMjE 

29 CLOT IE=NE)<TIE 

C WRITE (9,1C7) {RANG(K) ,K=1,NL ) 

30 CONTINUE 

WRITE (9, 121) 

121 FGRMATdX,' SORTEC VALUES') 

WRITE (9,1C7) (SVALS(K) ,K=ltNL) 

WRITE (9,122) 

122 FORMATdX,' RA^KS CF VALUES •) 

WRITE (9,1C7) (RANQ(K) ,K=1,NL ) 

C WRITE (9,1C8) (KEY (K ) ,K=1 ,NL> 

107 FORMAT (IX , 7F10. 3) 

C108 FORMAT dX , 7110) 

RETURN 

END 



SUBROUTINE RP ARAM ( NI F , NRP , I P , RP ) 

C THIS READS THE INTEGER ANO REAL PARAMETERS OF THE OPTIONS 
C FILE FOR EITHER MSTROR OR CELLER 
INTEGER MP ,NPP, I P(50) 

REAL RP(IC) 

101 F0RMAT(16I5) 

102 FORMAT(3F10.3) 

PEADdOtlCl) MP 

IF (NIP.GE.l) REAC(10,101) ( JP( I ) ,1 = 1 ,NI P ) 

RE AD (10, 101) NRP 

IF (NRP.GE.l) R£AC(10,102) ( RP( I ) , 1= 1 ,NRP ) 

RETURN 

END __ 

SUBROUTINE TH>PA(X,NCF) 

C THIS WRITES THE RESULT FOR A KOLMOGOROV, LILLIEFORS 
C TEST TO THE CLTPUT FILE WHEN THE HYP=1 
WRIT E(NOF , 101 ) X 

101 FORMATdX,' THE NULL HYPOTHESIS: F*( X ) = S ( X) ' / IX , ' THE ' , 

*' TEST STATISTIC IS LARGER OF THE ABSOLUTES VALUES'/ 
*3X,’0F THE TWC D I FFE RENCES: ' /IX , F 10. 4 ) 

RETURN 

END 

SUBROUTINE THYPB(X,NCF) 

C THIS WRITES THE RESULT F CR A KOLMOGOROV, LILLIEFORS 
C TEST TO THE CLTPUT FILE WHEN THE HYP=2 
WRITE(NOF ,101 ) X 

101 FORM AT( IX, 'THE NULL HYPOTHESIS: £=*«( X )<=S ( X) ' / IX , • THE ' , 

♦' TEST STATISTIC IS LARGEST DIFFERENCE '/ 

*3X,'0F F*(>I ) -S( XI) : ' /1X,F10.4) 

RETURN 

END 

SUBROUTINE THVPC(X,NQF) 
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C THIS WRITES Th£ RESULT FCR A KOLMOGOROV, LILLIEFORS 
C TEST TO THE CLTPUT FILE WHEN THE HYP=3 
WRIT£(NGF,101) X 

101 FORMATdX, 'THE NULL HYPOTHESIS: F«( X ) =>S ( X) • / IX, ' THE ' , 

#* TEST STATISTIC IS SMALLEST (OR MOST NEGATIVE}'/ 
*3X,'0F F*(XI )-S( XI): '/1X,F10.4J 
RETURN 
END 



gram CONTAINS THE TEST SUEROUTINES 
T number; tests are SEPARATED BY 



C THIS SECTION CF THE PRO 
CIN THE ORDER OF THEIR TES 
C -/-/-/ 

SUBROUTINE BIRTST(NO 
C THIS MASTER SLBROLTINE F 
C BASED ON THE TCTAL NUMBE 
C THE HYPOTHESIZED FROBABI 
C THE NULL HYPOTHESIS IS E 
C UPPER CONFIDENCE INTERVA 
C PROBABILITY BASED ON THE 
C IF N<20, EXACT BINOMIAL 
C OTHERWISE THE NORMAL APP 
INTEGER NOF, IP( 50) , 

REAL ALPHA, ALPHAT,C 
♦ NKV(21) 

T=RCCT( 1 ) 

N = RCCT(2 )■»! 

PSTAR=RP (1) 

ALPHYP=1 .C-RP (2) 

HYP= IP(1 ) 

WRITE (NCF,201) N ,T, P STAR ,ALPHYP 
201 FORM ATdOX,' B INOMIAL TE ST : ' / IX, • OF ' , 1 5 , ' ELEMENTS ', 

WERE SUCCESSES'/IX, 

P STAR IS:' ,F10.5/ 

GNIFICANCE IS:',F10.5) 

(NOF, 301) 

S IS P=P STAR, ALT IS POP STAR' ) 
(NOF, 302) 



F,kCCT,RP,IP 
INDS ALPHA H 
R N, THE NUM 
LITY PSTAR,A 
VALUATEO. IT 
L BOUNDS (Cl 
NR CF SUCCE 
COMPUTATIONS 
RCXIMATICN W 
RCCT(50) ,N,T 
IUL,CILL ,RP( 



, AL PH AT, 
AT OF A 
BER OF S 
ND FCRM 
THEN FI 
LL,CIUL) 
SES AND 
WILL BE 
ILL EE U 
,HYP 

50) ,PSTA 



CILL , CIUL) 
BINOMIAL TEST 
UCCESSES T, 

CF HYPOTHESIS. 
NOS LCWER AND 
FOR ACTUAL 
FAILURES. 

PERFORMED, 

SED. 

R ,ALPHYP, 



'OF DATA SET', 15,* 
' THE HYPOTHESISED 



301 



IX,' THE LEVEL OF SI 
IF (HYP. EC. 1) WRITE 
FORMATdX, 

* ' THE NULL HY POTHESI 
IF (HYP. EC. 2) WRITE 

302 FORMATdX, 

* ' THE NULL HYFOTHESI 
IF (HYP. EC. 3) WRITE 

303 FORMATdX, 

«'THE NULL HYPOTHESIS 
IF (N.GT.20) GO TO 3 
CALL BINH2C(N,T,P 
CALL CIBINC(N,T,A 
GO TO S9 
30 CONTINUE 

CALL BINOPM(N ,T, PSTA 
99 CONTINUE 

WRITE (9,101) ALPHAT 
WRITE (NOF, 101) ALPH 
101 FORMATdX,* ALPHA HA 
RETURN 
END 

C-/-/- /-/-/-/-/-/-/-/- /-/- 

SUBROUTINE B I NORM 

* ( N ,T,PSTAR ,HYP 

THIS SUBROUTINE FINOS TH 

AND THE CONFIDENCE LIM 
CALLING THE SUBROUTINE 
ALPHYP THE ALPHA SPECIF 
ALPHAT THE ALPHA RETURN 
ON THE NUMBER 
ZHAT THE EMPIRICAL Z 
E(N SUCCESS)=( 
ZHYP Z RETURNED FRCM 



S IS P<=P STAR, ALT IS P>P STAR*) 
(NOF, 303) 

IS P>=P STAR, ALT IS P<P STAR') 

0 

STAR,HYP,ALPHAT ,NKV) 

LPHYP ,NKV,CI LL,CILL) 



R, HYP, ALPHAT , ALP HYP , C ILL , CIUL ) 

, CILL, CIUL 
AT, CILL, CIUL 

T, Cl LOWER, UPPER* ,3F10.3) 






C 

c 

c 

c 

c 

c 

c 

c 

c 



, ALPHAT, ALPH 
E ALPHA HAT 
ITS OF THE S 
NCRMV 

lEO IN TEST 
ED BY THE NO 
OF SUCCESSES 
= STANDARD N 
NOPSTAR) ,OIV 
NCRMV BASED 



YP,C ILL, CIUL ) 

CF A BINOMIAL OIST 
AMPLES* PROBABLITY 

1. 0-LEVEL OF SIGNIF 
RMV SUBROUTINE BASED 
AND TOTAL SA.vPLE 
ORMAL N SUCC MINUS 
ICED BY STD DEV (NPQ ) 
ON ALPHAS" OF FYP 
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INTEGER NtT,H\P 

REAL ZH AT tZfALPH, ALPHA! ,CILL ,CI U L ,NP ST ,S TDEV t ZHYP 
NPST=FLOAT(N) ^PSTAR 

ZHAT = (FLCAT(T)-NPSTJ / ( ( NPST« < 1. 0 -PST AR) ) **0.5 J 
CALL NORMVdtZHATtALPHAT) 

IF (HYP.GT.ll GO TC 20 

IF (ZHAT.GT.O) AL PHA T= ( 1 . 0-AL PHAT ) *2. 0 
IF (ZhAT.LE.OI AL PHAT=ALP HAT»2.0 
GG TO 3C 

20 IF (HYP. EC. 2) AL PHAT = 1. 0-ALPHAT 
3 0 STDEV=( FLCAT( T«( N-T) )/( ( FLOAT (NJ J=>*3 J J*#0.5 
CALL N0R^<V(2t 2HYP tALPHYPi 
TOIVN=FLCAT(T I/FLOAT (N) 

CILL=TOI VR*ZHVP#STDE V 
C1UL=TDI VN-ZHYP*STDE V 
RETURN 

END __ 

SUBROUTINE NC FUZK (N, NKV ) 

C THIS SCUBRCLTINE PRODUCES THE NKV VECTOR OF N CFOOSE K 
C FOR K=0,... THE INDICES OF NKV ARE ONE 

C HIGHER THAN THE COR RES PONDI NG K , E .G . , 1= 1 , . . . ,N + 1 

REAL NUM ,C£N0^,NKV(2i^ 

INTEGER UL,MEC,NP1,I PlfNPlMI ,N, I 
NP1=N+1 

LL=IFIX (N*0. 5) 

PED=IFIX (NPi<0.5) 

NKV( 1)=1 
NKV( NPl ) =1 
NUM=1.0 
DEN0M=1. C 
CO 10 I=1»UL 
IP1=I+1 
NP1MI=NP1-I 
NUM=NUP^FLCAT (NPIMI ) 

D£N0M = C5NGN*FLCAT (I) 

NKV( I Pl)=NLM/DENOy 
I F( I . LT.NEC) NKV( NPIMI )=NKV( I PI) 

10 CONTINUE 

V^RITE (9,101) 

101 FORMAT (!)(,• ^ CHOOSE K VECTOR') 

WRITE (9,1C2) (NKVd ) ,I = 1,NP1) 

102 FORM AT( a F10.3 ) 

RETURN 

END __ __ _ 

SUBROUTINE BI NPMF ( P, N ,NKV tVECLIM , PQV t PMF , CUMPR ) 

C THIS SUBROUTINE RETURNS THE VECTOR PQV CCRR E SPONCING TO 
C P*«K TIMES Q**(N-K) FOR K=0 , . .. , VtCL IM» AND PART OR ALL 
C OF PMF FOR BINCMIAL. LET VECLIM = N TO PRODUCE ENTIRE PMF. 
C FOR LINESEARCF, IT WILL BE LESS. VECTOR NKV FROM NCHUZK 
C CUMPR IS CUMULATIVE PROBABILITY 
INTEGER I , VECLIM ,N,K ,UL 

REAL P»QtPGV(21) ,PV(21) , QV( 2 1 ) , P HOLD t QHOLD, NK V( 21 ) , 
*CUMPR,DI VER,PC,PMF(21 ) 

CUMPR=O.C 
G=1.0-P 
LL=VECLIM+1 
NP1=N+1 
Cl VER=P/C 

WRITE! 9, iC3) P,Q,PQ, DIVER, VECLIM 

103 FORMATdX, *P, C , PQ , 01 VER , * ,4F 10. 7 , • VECLIM • ,I 5 ) 

?Q=Q=«‘«N 

CO 20 1=1, UL 
PCV( I )=PG 
PQ=PQ*CIVEF 
PMF( I)=PQV(I)^NKV(I) 
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20 

101 



CUMPR=CUMPF+PMF(I 
CONT INUE 

l^RITE (9,101i VECLIM 
FORM ( 1>, *VECL IM' - 



> 



* P • ,F8.3 ,• 



102 



CUM 
(PQVa 
i 



, PtCUMPR 
15, 

PR06: *,F8.3, 
) ,I=1,NP1) 



P Q VECTOR: •) 

WRITE ( 9,1C2) 

FORM 4T( 8 F10.3 
RETURN 

END _ _ _ _ 

C iuBROUTIKB B I NH20 <N, T , PSTAR , HYP , ALPHAT.NKV) 

C THIS FIGURES THE ^LPHA HAT VALUE f AND RETURNS THAT 

C VALUE ANC THE N CHOOSE K VECTOR TO THE MAIN SUBROUTINE 
C 6INTST. 

REAL PS TAP, ALPHAT.NKV (2 1 J tPQV(21 ) ,PMF(21 ) ,CUMFR 
INTEGER N,T,HVP, I .TPl.NPl 
CALL NCHLZKIN ,NKV) 

NP1=N+1 

CALL BINFMFIPSTAR ,N, NKV ,N ,PQ V , PM F , CUMPR » 



TP1= T+1 
ALPH1=0 
ALPH2=0 

IF (HYP.EC.3J GO TO 20 
DO 15 I=TP1,NP1 

ALPHl=ALPhl-»-PMF(I) 

15 CONTINUE 

ALPH=ALFH1 

20 IF (HYP. EC. 2) GO TO 30 

DO 25 1=1, TPl 

ALFH2=ALPH2+PMF(I) 

25 CONTINUE 

ALPH=ALFH2 

30 IF(HYP.NE.1» GO TC 99 

I F(ALPF.GT .ALPhl ) ALPH=ALPH1 
A LPH = ALFH*2.0 
99 WRITE(9,1C1) iSLPH 

101 F0RM4T(1X, • VALUE OF ALPHA HAT:',F10.3) 

ALPHAT=ALPH 

RETURN 

END __ __ 

SUBROUTINE CIEINO(N, T ,ALPHYP ,NKV ,CILL ,CI UU 
C THIS USES A BISECTION SEARCH TC FIND THE UPPER AND 

C LOWER LIMITS (CILL,CIUU TO THE PROBABILITY CF SUCCESS 
C GIVEN THE NUMEER CF SUCCESSES T AND TOTAL SAMPLE SIZE N. 
C NKV FROM NCHL2K, 

C THIS USES SYMMETRY CF BINOMIAL, E.G., A 01 ST WITH 
C 8 SUCCESSES OF 10 WILL HAVE THE LIMITS OF PROBABILITIES 
C 1.0 - LIMITS CF A OIST WITH 2 SUCC OF 10 ( UP P £R<=>L0WER1 
INTEGER N,T,TT,I,J,TTM1,LC0UNT 

REAL ALPHYP,NKV(2i), CILL,CIUL,MLEP,LALPH,UALPF 

UALPH=ALPHYP#C.5 

LALPH=1.0-UALFH 

NDIV2=IFIX(FLCAT(N+1 ) #0 . 5 * 

TT=T 

IF(T.GT.NCIV2 ) TT=N-T 
TTMl =TT-1 

«LEP=FLOAT(TT )/FLOAT (N) 

PLL=0.0 

PUL=C.O 

IF (TTMl.Gc.OJ PLL=BISEX(N,TTM1,NKV,LALPH) 

IF (TT.LE.N) FUL=eiS£X(N,TT,NKV,UALPH) 

IF(T.LE.NCIV2) GO TO 10 
C ILL=1 .0-PLL 
CIUL=1.C-PLL 
GO TO 20 
10 CILL=PLL 
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i 



c:ijl=pul 

IF { CILL.lt. 0.0) CILL=0.0 
IF ( CIUL.GT.1.0) CIUL=1.0 
20 WRITE (9,1011 CILLfCIUL 

101 FORMAKIX,' PERCENT LOWER LI MIT • , FI 0. 3 , 

=i' • UPPER LIMITSF10.3) 

RETURN 

END _ __ 

FUNCTION EISEX(N,T,NKV,ALPHI 
C THIS DOES A RAY SEARCH TO FIND THE EXACT P FCR WHICH 
C THERE IS A l.C-ALFHA/2 OR ALPHA/2 PROS OF T SUCCESSES IN 
C N TRIALS. NKV IS VECTOR CF N CHOOSE K,K=0,...,N 

REAL NKV (21) , ALPH,LO ,MEO,HI , EPSI LN,CUMPR ,PQV( 21) , 

♦ GAP, P,EPSM, PNF( 21 ) 

INTEGER N ,T,LCOUNT,LLIM 
£PSILN=O.C02 
SPSM1=1.0-£PSILN 
LC0UNT=0 

LLIM=64 

IFCALPH.LT.O. 5) GC TC 10 
L0=0. 0 

H I=FLCAT(T )/FLOAT (N) 

GO TO 2C 

10 LO=FLOAT IT )/FLQAT(N) 

HI=1 .0 

20 NEO=0.5«(LC+HI) 

WRITE(9,2C1) LO, MED, HI, LCOUNT 

IF ( (M£D.LE.£FSILN).CR. (MED.GE.EPSMl) ) GC TO 95 
201 FORMATdX,' LC,MED,HI, LOOP COUN T • , 3F 10. 5 ,I 5 ) 
LCCUNT=LCCUNT-*1 

CALL BINPMF(MED,N,NKV,T ,PQV,PMF,CUMPR) 

GAP=CUMPR-ALPH 

IF((ABS(GAP).LT.EPSI LN).CR. (LCOUNT. GT.LLIM) ) GO TO 95 
IF(GAP. LT.Q.O ) GO TO 30 
LC=MEC 
GO TO 2C 
30 HI=MEO 

GO TC 20 

95 WRITE (9,1C1) ALPH,MEO 

101 FORMATdX,' ALPHA: ' , F10.3,' Cl L IMI T : * , F 10. 3 ) 
eiS£X=MEC 
RETURN 
END 

C-/-/- 

SUBROUTINE QU ANT L ( NOF ,RCCT, R P,l P , ALPH AT , C ILL , ClUL , 

♦ LIST,LPTR,T1EV,TYPCEL) 

C THIS FINDS THE PROBAEILITY THAT A SPECIFIED X ,XSTAR 

C IS THE SPECIFIED CUANTILE POINT QSTAR IN A OISTRISUTION 
C REPRESENTED EY THE VALS GF SAMPLE. THE CONFIDENCE 
C INTERVALS (WHICH ENCLOSE POPULATION QSTAR) ARE GIVEN 

INTEGER NCF,RCCT(50) , IP ( 50) , LPTR ( 17 ) ,TIEV(2) , T1 , T2 , 

♦ T2P2,T1F1,N,NP1,RHAT,SHAT,KEY(400 ),TYPC£L,HYP 
REAL RP ( 50 ,ALPHAT,C ILL,CIUL ,LIST(400) ,NK V( 2 1 ) , PSTAR , 

♦ PMF (21 ) , ALPhl,ALPH2 ,ALFMIN, 2HA T 1 ,ZHAT2, PQV ( 2 1) , 

♦ CUMPR,ZSTAR,NP,CCNFCC, LTAI L ,UTA IL , ZRHAT , ZSHAT,ALPHYP 
WRIT£(N0F,523) 

523 FORMATdCX, •♦♦QUANTILE TEST**' ) 

T1=PCCT( 1)-TI£V( 1 ) 

T2=RCCT( 1 )+TI£V( 2) 

HYP= IP(1 ) 

PSTAR=RP (1) 

AL PHYP=1 .0-RP (2) 

HALFA=0 .5*ALPHYP 
N=RCCT( 2)4RCCT(1) 

WRIT E(NOF, 334 ) T1,T2 ,N 

334 FORMATdX,' NUMBER OF DATA ELEMENTS LESS THAN X*',I5/ 
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102 

103 

10 

15 

104 

30 

I 08 

40 

174 

5 

101 

95 

107 



«1X,' NUM6EP 0 

♦ 15/lXf* TOTAL 
^Pl=N+l 

IF (N.GT.2C) 
CALL NChUZ 
CALL BINPM 
CALL CI20Q 
WRITE (9,1 
FORMAT ( IX , 
WRITE (9,10 
FCRMATdX, 
T2P2=T2+2 
T 1P1 = T1 + 1 
ALPHA1=0.0 
ALPHA2=C,0 
00 10 1 = 1 , 
ALPHA 
CONTINLE 
00 15 I=T2 
ALPHA 
CONTINUE 
WRITE(9,10 
FORMATdX, 
GO TO 40 
ETDE V=(FLCAT( 
NP=FLCAT(N 
ZHATi=(Tl- 
CALL NCRMV 
ZHAT2=(T2- 
CALL NCRMV 
ALPH2=1.0- 
CALL NCRMV 
RHAT=l-»( IF 
SHAT=1-KIF 
ZRHAT=(RHA 
CALL NCRMV 
ZSHAT=(SHA 
CALL NCRMV 
UTAI L=1.0- 
C0NFCC=1.0 
W RITE (9,10 
FORMATdX, 
ALFM IN=ALPFA1 
IF ( ALPHAl.GT 
WRITE (9,17 
FORMAT (IX, 
IF (HYP. EC. 1) 
IF (HYP.EC.2J 
IF (HYP.EC.3J 
DO 5 1=1 ,N 

K£Y( I)= 
CONTINUE 
IF ( TYPCEL.EQ 
CALL SHSCPT ( 
WRITE (9,1C3> 
CILL=LIST (RHA 
IF(SHAT. GT.N) 
C:UL=LIST('HA 
WRITE (9,10U 
WRITE (NGF,10 
FORMAT! IX, *AL 

* 'INDEX OF LOW 
GO TO 99 
WRITE (9,107J 
WRITE (NCF,iO 

X , • AL 
INCICES OF 



F ELEMENTS 
NUMBER OF 



LESS THAN 
ELEMENTS • 



OR EQUAL 
,15) 



TO X=i‘' 



GO TO 3 0 
K(N ,NKV J 

F(P$TAR ,N,NKV,N,PQV ,PMF,CUMPR) 
(ALPHYP,N,PMF,RHAT , SHAT , CON FCO ) 
C2) 11, T2,RHAT, SHAT 
• T1,T2 ,RHAT,SHAT* ,415, • PMF:' I 
2) (PMF ( I) ,I=i,NPl) 

10F6.3) 



TIPI 

1=ALFHA1+PMF(I ) 

F2, NPl 

2=ALFHA2+PMF(I ) 

A) ALPHAl, ALPHA2 

• ALPHAl, ALPHA2:' ,2F7. 3) 

M*PSTAR*( l.O-PSTAR ) )*=«‘0.5 

)=i'PSTAR 

NPJ/STOEV 

(1, ZHAT 1 ,ALPH1 ) 

NP) /STDEV 

(1, ZhAT2,ALPH2 ) 

ALPH2 

(2, ZSTAR,HALFA ) 

IX( NP+ZSTAR«STO£V-EPSILN)) 
IX(NP-ZSTAR*STDEV“EPSILN) ) 
T-NP ) /STDEV 
(1, Z-RHAT ,LTAIL ) 

T-NP)/STOEV 
(1, ZSHAT,UTAIL ) 

LTAI L 

-LTAIL-UTAIL 
£) CCNFCO, LTAIL ,UTA IL 

• CONFCC , LTAIL, UTAI L' ,3F7. 3 J 



.ALPHA2) ALFMIN=ALPHA2 
A) ALPHAl, ALPHA2,ALFMIN 
• ALPHAl, ALPHA2, ALFMiN: 
ALPHAT = 2.0=*ALFMIN 
ALPhAT=ALPH2 
ALPhAT =ALPF1 



I 



, 2F7.3) 



.5) GO TO 95 
LIST ,KEY,N ) 

(LIST! I ),I=1,N) 

7 ) 

SHAT=N 

1 ) 

ALPHAT , Cl LL, Cl UL 

1)ALFHAT,LIST(RHAT ) , LI ST ( SHA T ) ,RHAT ,SHAT 
PHA HAT, LOWER AND UPPER BOUNDS ', 2F9 . 3/1 X , 
ER LIMIT', 15,', INDEX OF UPPER LIMIT', 15) 

ALPHAT ,RHAT, SHAT 
7) ALPHAT.RHAT, SHAT 
PHA HAT',F7.3, 

LOWER AND UPPER BOUNDS ’,21 5) 
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99 

109 



l^RITE (9,139) C 
V^RITE (NCR, 109) 



FORMAK 

RETURN 

END 



CONFCO 
CCNFCC 



IX, 'INTERVAL CONFIDENCE C QEFF I Cl E NT: ' , FIO . 3 J 



HAT, ShAT, CONFCO) 

AND SHAT THAT WILL 
ICIENT OF AT LEAST 



1,1 ,IH1,J,JP1 

PFYP ,HALFA, B£ST,CRIT , 



SUBROUTINE CI20Q ( AL PHY P , N, P MF , R 
C THIS SUBROUTINE FINDS INDICES RHAT 
C INCLUDE WILL GIVE A CONFIDENCE COEFF 
C l.O-ALPHA, USED FCR QUANTILE TEST, 

C RHAT IS INDEX CF LOWER LIMIT, SHAT IS INDEX OF UPPER 
C BOUND. SINCE INDICES OF PMF ARE 1 
C TO N+l, S=SHAT, R=RHAT 

INTEGER RFAT, SHAT,RMAX, SMIN,N,NP 
REAL PMF (21) , CDF(23) ,RCDF(23 ),AL 
* CR ITl ,CRIT2 ,CONFCO 
NP1=N+1 

HALFA=0 ,5=»ALPFYP 
1=1 

CCF( 1 )=PNF (1 ) 

10 CONTINUE 
I M1=I 

IF ( CCF( I ) ,GE .HALFA) GO TO 15 
I =1+1 

CDF( I )=CDF(IM1)+PMF( I ) 

WRITE(9,191) I,PMF(I ),CDF(I) 



191 FCRMATdX, *I,PMF( IMl ) ,CDF (I)' ,I5,2F10,4) 

IF ( CDF( I ).LE .HALFA) GO TO 10 

15 RHAT=IM1 
J=NPi+l 
RCDF(J)=0 
20 CONTINUE 
JP1=J 
J=J-1 

RCDF( J ) =RC CF ( J FI) +PMF(J) 

WRITE(9,192) J ,RCOF( J) ,PMF(J) 

192 F CRM AT (IX, *J,RCDF ( J ‘ , PMF ( J ) • , 1 5 , 2F 10 . 4 ) 

IF ( RCDF(J).LE. HALFA ) GO TO 20 

SHAT=J 

C0NFC0=1 .0-CCF(RHAT)-RC0F(SHAT) 

WRITE (9,101) CDF (RHAT) ,RCDF (SHAT) , CCNFCC, RHAT, SHAT 
101 F0RMAT(1X,'L0 , UP, CONFCO, INDICES, Cl:*, /IX, 

* 3Fi0.4,2I5) 

RETURN 

END 

C-/-/-/-/-/-/-/-/-/ 

SUBROUTINE COXSTU (L 1ST ,LPTR ,IP , 

C THIS SUBROUTINE DIVIDES THE ONE 

C PERFORMS THE SIGN TEST TC TEST THE T 
C PAIRING THE FIRST HALF DATA ELEMENTS 
C THERE ARE TWO OPTIONS: 1=0N£ VARIABL 
C 2=C0RRELATI CN, RANKS OF THE INDEPEND 
C ORDERING THE DEPENDENT VARIABLE 

INTEGER LFTR( 17) , IP( 50) ,NOF, TLPT 

* HYP, I NCEPC ,OEPC,KEY( 200) ,NACT 
REAL LIST(^00 ),RP(50 ),TLIST( 400) 

* INDEP(2C0) 

WRITE(NOF,523) 

WRIT E(9, 523) 

523 FORMATd CX,'**COX AND STUART TES 
IF ( IP( 2 ).EQ, 1) GO TO 7 
N = LPTR( 2 )-l 
IN0EPC=0 
CEPC=0 

IFd P(3 ) .LT.IF(4) ) INCEPC=N 
IF(IP(4) .LT. IF(3) ) DEPC = N 
CO 5 1 = 1 ,N 



/- 

RP,NCF,NTEST) 

VARIABLE IN TWO, THEN 
REND AS A MATTER OF 
WITH THE SECCND HALF 
E, IN PRESENT ORDER, 
ENT VARIABLE 

R(3) ,NHALF,MEC,NHT2, 

UL 

,SLIST(4CC) ,D£P(200) , 



T FOR TREND***) 
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5 

918 

6 

7 

8 
9 



10 

20 

30 

40 

111 

112 

113 

50 

C-/- 



5 23 



111 

112 

113 



KEY( I ) = I 

IN0EP{ I) = LIST( I + INOEPC) 

OEP( I ) = LIST(H-DEPC) 

CONT INUE 

CALL SHSCPK UDEP,KEY ,NJ 
CO 6 1=1, N 

SLIST ( I) = OEP(KEY( H) 

WftITE<9,91£) I ,KEY(I ) ,SLI ST( I J 
FORMAT (IX, 'IN COX STU : I ,KE Y , SL I ST : • ,21 5, FI 0.3 J 
CONTINUE 
GO TO 9 
CO 8 1 = 1 

SLIST( n=LIST( I> 

CONT INUE 

NHALF=IF IX(0.5=«'( LPTR (2)-in 

NHT2=2*NhALF 

TLPTRd )=1 

TLPTR(2 ) =1 + NH/LF 

TLPTR(3 ) =1+NHT2 

NACTUL=LPTR(2 )-l 

HYP= IP( 1 ) 

CO 10 I=1,NHALF 

TLIST( I l=SLIST (I) 

CONT INUE 

IF ( NACTUL.GT .NHT2) GO TO 30 
DO 20 I = NH;aLF .NACTUL 
TLIST( 1 ) = SLIST( I) 

CONT INUE 
GO TO 50 
LL=2+NHALF 
DO 40 I = LL, NACTUL 
I M 1= I - 1 

TLIST( IM1)=SLIST( I I 
CONTINUE 

IF(HYP.EC.l) t«RITE ( N0F,11H 

FORMATdX,* NLLL FYPQTHESIS: THERE iS NO TREND •) 

IF (HYP. EC. 2) WRITE (N0F,112) 

FORM AT( IX, 'NULL HYPOTHESIS: THERE IS NO UPWARD TREND • ) 
IF (HYP. EC. 3) WRITE (N0F,113) 

FORMAT( 1 > , 

* 'NULL hVPOTFESiS: THERE IS NO DOWNWARD TREND* i 
CALL SGNT(TLI ST, T LPT R ,NOF ,NHT2, R P , IP , NTEST) 

RETURN 

END 

/ 

SUBROUTINE SGNT( L I ST , LPTR ,NOF ,T OTELM , BP , IF, NTEST) 

REAL A(2 0C J ,B (200 ) ,R P (50 ) , L I ST( 4 00 ) , P , EP S ILN , ADI F ,01 F , 
*NKV( 21) ,ALPHAT,ALFHYP,CIUL,CILL, ALPHAl, ALPHA2 
INTEGER LPTR ( 17) ,I,J ,NE LMV, I E ,K , M , NPLUS , Nf'I NUS ,NTOT , 
*HYP, IP(50) , NTEST, NOF 

NELMV IS THE hUMBER CF ELEMENTS IN VECTORS A AND B 

NPLUS=0 

NMINUS=0 

IF (NTEST. £Q. 5) WRIT E (NOF ,523) 

FORM AT(10X,**^SIGN TEST*^*) 

N£LMV=LPTF(2I -1 
EPSI LN=0.CC01 
HYP=IP(1 ) 

IF (HYP. EC. 1) WRITE (NOF, 111) 

IF ( HYP. EC. 1 ) WRITE (9, 111) 

FGBMATdX,' HC: P ( +) =P ( - ) */ 1 X, • HI: P( + lOP(-)') 

IF (HYP. EC. 2) WRITE (NOF, 112) 

FORMATdX,' HC: P ( +)<=P (- ) * / IX, • HI: P(+)>P(-)*) 

IF (HYP. EC. 3) WRITE (NOF, 113) 

FORMATdX,' HC: P ( +) >=P (- ) • / IX, * HI: P(+XP(-)*) 
ALPHYP=1 .C-RP (1) 

CO 14 1=1, NELMV 
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987 
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102 



104 

105 
99 
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10 

30 

32 

35 

103 

40 



101 



J=LPTB(2J-1+I , 

DIF=L IST( I )-LlST( J ) 

WRITE ( 9,917) I , J. LIST ( I) , LIST (J) ,0IF 

FORMATdX , 'IN SIG N :I , J ,L U-J ) ,01 F : • ,2 I 5, 3 F 10. 3 ) 

ADIF=AeS(OJF) 

IF(ADIF.LE.EPSILN) GO TO 14 
IF (OIF.LT.O) NPLUS=NPLUS+1 
IF (CIF.GT.O) NMINUS=NMINUS+1 
CONTINUE 

WRITE(9,9€7) NPLUS,NMINUS 
WRITE(N0F,986 ) NPLUS,NMINUS 
FORMAT! IX,' SIGN TEST +,-',2I5) 

FORMATdX,' NLMBER OF { + ):*, 15, » NUMBER OF (-):', 15) 
NTOT=NPLLS+NMINUS 

IF (NT0T.LE.2C) CALL NCHUZK ( NTOT ,NKV ) 

IF (NT0T.LS.2C) CALL BINH20 

♦ (NTCT,N PLUS ,0.5 ,HYP , ALP hAT , NKV ) 

IF (NT0T.GT.2C) CALL 

♦ eiNOBM! NTCT,NPLUS ,0. 5 ,HYF,ALPHAT ,ALPHYP, C ILL , CIUL) 
ALPHA2=1.C-ALFHA1 

4LFM IN=ALFFA1 

IF ( ALPHAl.GT .ALPHA2 ) ALFMI N=ALP HA2 
IF (HYP. EC. 1) ALPHAT = 2. 0=>ALFMIN 
IF (HYP. EC. 2) ALPHAT=ALPH2 
IF (HYP. EC. 3) ALPhAT=ALPHl 

WRITE(NOF,102) NE LMV , NTOT ,NP LUS , ALPH AT ,ALFHYP 
FORMATdX,' NLMBER OF PAIRS:', 15,' N CF ', 

♦ 'PAIRED VALUES NOT TIED' ,15 ,/• NUMBER OF FLUSSES',15, 
♦/' ALPHA FAT: ',F10.4/' HYPOTHESISED ALPHA ',Fi0.4) 

IF (ALPHAT.lt. /LPHYP) WR IT £( NCF, 1 04 ) 

IF ( ALPHA T.GE. ALPH YP) WR I TE( NOF , 1 05 ) 

FORMATdX,' REJECT TFE NULL HYPOTHESIS ') 

FORMATdX,' DC NOT REJECT THE NULL HYPOTHESIS') 

RETU RN 
END 

SUBROUTINE MANWHT ( LP TR, LI ST , NOF . RP ) 

INTEGER Ni,N2,NT ,NHOLO, I,J,LPTR( 17), lER 

REAL A( 400 , LIST (400 ) ,RPd) ,U,Z,R(400) ,ALPH,ALPHA7 

WRITE(NOF,523 ) 

FORMATdOX ,'*=>MANN WHITNEY TEST*«*) 

N1=LPTR( 2)-LPTR( 1) 

N2=LPTR(3)-LP7R(2) 

NT=N1+N2 

IF(N1.LE.N2)GC TO 30 
DC 10 1=1, FT 

J=LPTR(3 ) -I 
A( I ) = LIST(J) 

CCNTINLE 
NH0LD=N2 
N2=N1 
N1=NHCLC 
GO TO 35 
CO 32 1=1, NT 
Ad) =LIST( I) 

CONT INUE 

CALL UTEST ( A ,R, N1 ,N 2 ,U , Z , I ER ) 

IFdER. L7.1) GO TC 40 
WRITE (NOF, 103) 

FORMATdX,' ALL VALUES OF ONE SUBGROUP ARE TIED*) 

IF (N2. L7.20 ) GO TO 95 
CALL NORMV( 1 ,Z, ALPHAT) 

WRITE (NGF,101) U,Z, ALPHAT 
FORMATdX,* TEST STATISTIC U:',F10.4, 

♦ 'STANDARCIZED Z :*,Fi0.4/lX, 

♦ ' ALPHA HAT:' ,F10.4) 

ALPH = 1.0-RP( 1 I 
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IF (A LPHAT.lt, ALPHJ W R IT £ ( NOF , 104 J 
IF(ALPHAT.GE. ALPH» W R IT E (NO F » 105 ) 

104 FORMATdX,' REJECT THE NOLL HYPOTHESIS ') 

105 FORMAT! IX,* DC NOT REJECT THE NULL HYPOTHESIS'J 



95 

401 

99 



523 



GC TC 99 

WRITE(401) U,M,N2 

FORMATdX,* TEST STA T IS TIC U : * , F 10. 3 / IX, 

♦ • WITH N1 = *,IA,', AND N2=*,I4I 
RETURN 
END 

c- 

SUBROUTINE MC NEMR ( RC CT, RP ,I P ,NOF ) 

C THIS SUBRCLTINE FIGURES THE VALUES OF THE MCNEMAR TEST 
C IF N<21, IT USES IHE EXACT VALUES OF THE BINCMAL,ELSE 
C USES THE LINEAR INTERPOLATION OF CHI-SQUARE DF=1 
INTEGER RCCT( 4* , I F ( 5 0 ) , B , C, N , T2 , NKV(2U 
REAL RP( 5CdALPHAT,ALPHYP ,tl 
WRITE (NCF,523) 

FORMATdCX ,*=<'*MCNEMAR TEST*«*» 

E=RCCT( 2 ) 

C=RCCT(3 > 

ALPHYP=1.C-RP (1) 

T2=B 
N = B+C 

IF ( N.GT. 20) GO TC 30 

CALL e INH20(N,B, 0.5, 1,ALPHAT ,NKV > 

ALPHAT = C.5=»ALPHAT 
WRITE (NOF,103) 3 ,ALPHYP , ALPH AT 
FORMATdX, * T2 * , 15, 

♦ * HYPOTHESISED ALPHA, ALPHA HAT *,2F7.3) 

GO TO 40 

Tl = < FLOAT ((B-C)=i'*2))/FLCAT(B+C) 

CALL CHICNE (U,ALPHAT) 

WRITE (N0F,104) T 1 , ALPHYP , ALP HAT 
FORMATdX,' T1 * , F9. 3 , 

* HYPOTHESISED ALPHA, ALPHA HAT *,2F7.2) 

IF ( ALPHAT.LE .ALPHYP ) WRITE (NOF,101) 

FORMATdX,* REJECT THE NULL HYPOTHESIS') 

IF ( ALPHAT.GT .ALPHYP ) WRITE (N0F,102) 

FORMATdX,* DC NOT REJECT THE NULL HYPOTHESIS') 

WRITE (NCF,111) (RCCTd ) ,1=1 ,4) 

FORMATdX,* MCNEMAR TABLE :*/ 25X , 'AFTER: */ IX, * BEF ORE : * 
*12X, 'SUCCESS FAILURE* /IX, * SUCCESS* ,3X,2ilO/lX, 

♦ •FAILURE * ,2X, 2110) 

RETURN 

END 

C 

SUBROUTINE CHIONE (X,ALPHAT) 

C THIS SLBROLTINE FINDS AN APPROXIMATION FOR 

C CHI-SQUARE DF=1 BY LINEAR INTERPOLATION 
REAL CHI (8),ALPHAT,P (8) 

INTEGER N 
CHK 1 )=0 
CHK 2) = 1 .223 
CHK 3)=2.706 
CHK 4) = 3.841 
CHK 5) = 5.C24 
CHK 6) = 6. 615 
CHI(7)=7.c79 
CHK 8) = 1 C.83 
Pd) =0 
P(2) =0.75 
P(3) =0. 9 
F(4) =0.95 
P(5) =0. 975 
P(6) =0. 99 
P(7) =0.995 
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30 



104 

40 

101 

102 

111 
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P(8) =0,99<; 

10 CONTINUE 
1 = 1+1 

IF ( (X.GT.Chl (I) ) .AND.( I.LT.an GO TO 10 
IF ( I.LT.8 ) GC TO 20 
PHAT=C.999 
GO TO 95 

2 0 IM 1=1-1 

RATI 0=( X-Chl ( IMl n/(ChI (D-CHK IMl) J 
PHAT=P( IMi)+RATIO*(P U)-P(IMm 
95 alphat=i.c-ph;t 
RETURN 
END 

c-/-/-/-/- /- 

SUBROUTINE FRIEDM i N TREAT , LP TR, L 1ST » I P ,R P ,NOF ,TOT£LM I 
C THIS DOES EITHER THE FRIEDMAN AND DURBIN TESTS; 

C IP(1) =0, FRIEDMAN, =1, DURBIN; IF DURBIN, RP (2) =FILLER, 

C IP(2)=K=J# TREATS PER BLOCK; IPl3)=R=iS^ BLCCKS PER TREAT 
C FOR BOTH, RP(1)=CCNF COEF. RANQUE RANKS THE VALUES 
C BLOCK BY BLOCK; P/RCCP CCMPARES THE SUMS OF RANKS OF 
C DIFFERENT TREATMENTS TO TEST FOR PAIRWISE DIFFERENCE 
INTEGER NCF,NTREAT,LPTR( 17) , K ,B , I , J , R I , TO TE LM , 
*DF1,DF2,K£Y(4C0) , NBL CCK , I NT EST ( 4 00 ) ,T,R, IP(50I 
REAL LIST(400),T2,SVALS(400) ,RK(4C0J ,82,A2,RJ, 

*EPSI LN, 8K25 ,BM, 0EN0M,NUM,MP1,SR JS,NUNVAR,OENVAR, 

♦ STDE V,SRJ(17) ,TERM1, TERM2,RP(50) ,Q(17) 

CO 3 1=1, TOTE LM 

INTEST(I)=1 

3 CONTINUE 
MPl=-999999.9 

IF ( IP( 1 ) .EQ.l) GC TC 5 
WRITEIN0F,524) 

524 FORMAT! ICX, '«=»FRIE0MAN TEST***) 

GO T C 8 

5 WRITE(NOF ,625 ) 

625 FORMATdOX, **=»DURBIN TEST***) 

CO 6 1 = 1 ,TCTELM 

IF ( L IST( I ).GT.MP 1) MP1=LIST(I) 

6 CONTINUE 
MP1=MP1+1.C 
R=IP (3) 

K=IP (2) 

DO 7 I=1,TCTELM 

IF ! L IST( I ).NE.RP (2) ) GO TO 7 
L IS7! I )=MP1 
INTEST <I) =0 

7 CONTINUE 
)<RITE!9,919) FP(2),MP1 

919 FORMATdX,' INTEST VALUES, DURBIN; RP ( 2) , MPi * ,2FiO. 3 ) 
WRIT5(9,813) (INTEST(I) ,I=1,T0T£LM) 

816 FORMAT! IX, 1215) 

8 EPSILN=0.C01 
NBL0CK=LPTR!2 )-l 
CALL RANGUE 

* !TOT£LM,LIST,SVALS ,RK,Q,KEY,NTREAT,NBL0CK,2) 
WR1TE!9,315) !RK ! I ) , I =1 , TOT ELM ) 

315 F0RMAT!1X,6F1C.3) 

SRJS=0.0 
A2=0.0 
E=LPTR!2 )-l 
T=NTREAT 
CO 10 J=1,T 
RJ=0 

DC 15 1 = 1, E 

RI=I+LPTR! J) -1 

IF! INTcST !RI ) .EQ.O) GO TO 15 
RJ=RJ+RK! RI) 



312 

15 



10 



316 



112 



50 



101 

102 



99 



C-/-/- 



c 

c 

c 

c 



THI 
ELEM 
BLOC 
OF S 
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A2 = A2-MRK(RI 
k«RITE (9,212 
FCRMATdX I 
CONTI ME 
SRJ( J ) = RJ 
SRJS=SPJS+ (RJ«PJ» 
CONT INUE 

IF ( IP(1 ).EC.l) GC T 
K = T 

B2=SRJS/ ( FLOAKB I ) 
CEN0M=A2-E2 
URITE(9,216I A2,B2,0 
ktRITE(N0F,316 I A2,B2 
F0RMAT(1X,*A2 ,B2 ,DEN 
IF ( OENOM.LE. EPS I LN) 
EMl=FLOAT(g-l I 
BK25=0.25=MFLCAT( E*K 
NUM= EMI* ( E2-BK25 J 
T2=NUM/0£^CM 
0F1=K-1 

CF2= (E-1 )*(K-1) 

URITE (NCr,112) T2,0 
WRITE (9,1121 T2,CF1 
FORHATdX, 'VALUE OF 
NUMVAR=2,C*B* (Al-Bll 
CENVAR=(K-iJ*(B-l) 
STDEV=( NUMVAR/OENVAR 
CALL PARCCM( SPJ, T ,ST 
GO TO 99 
CONT INUE 
DF1=12*( T-1) 
CF2=(R*T*(K-1 )*(K + 1) 
TERM1=SRJS*FLCAT (DFl 
OF 1= T-1 

CF2=3*R*CF1*( 8+1 )/(K 
T£RM2=FLCAT(0F2I 
T3=T ERM1-7ERM2 
WRITE(9,101» 7ERM1,T 
FORMAT! IX, *TEPM1-2,S 
WRITE(N0F,102 ) T3,DF 
FORMATdX,' TEST STA 

• TO BE CCMPAPED WIT 

• WI TH DEGPEES OF FR 
CF2=B*K*Cf 1 

NUMV AR=FLCAT( (R*(K+1 
NUMVAR=NU^VAR*(FLCAT 
CF2=6*0F 1*(6*8-T-B+1 
CENVAR=FLCAT( CF2 ) 
STDE V=( NLPVAR/DENVAR 
CALL PARCCM( SPJ, T ,ST 
CONT INUE 
RETURN 
END 

/-/-/-/-/-/-/-/-/-/- 
SUBROUTINE QUAOE (NT 
S PERFORMS THE QUAOE 
ENTS BLOCK BY BLOCK 
K ELEMENTS (INTO 0). 
UMS OF TREATMENT RAN 
INTEGER NCF,N1REAT,L 
DF1,0F2,KEY (400 ) ,N 
PEAL LIST(TOTELM) ,T2 
EPSILN,BK25,EM1,Q(2 
OENCM, OENVAR,NUMVAR 
WRITE(NOF ,524 ) 
FORMAT! 10X-, '**QUAOE 
K=NTPEAT 



J**2) 

) I,J,RI, R J , A 2 

J,RI* ,3I5,'RJ,A2',2F1C.3J 



0 50 



ENOM 

,DENOM 

CM* ,3F10.3) 

GO TO 99 

*( (K+ll **2I I » 



F1,0F2 

,0F2 

T2:',F10.3,' OF 1 



*, 15, • 



OF 2 I ' , I 5 ) 



J**0.5 
OEV ,NOF) 



) 

l/FLCAT( 0F2J 
- 1 ) 

ERM2,SRJS,T3 
RJS,T3* ,4FiO,4) 

1 

TISTIC T3* ,F10.4,/iX , 
h CHISQUARED 01 STRIB UTI ON •/ IX , 
EEDOM=* , 15) 

)*(K-1) ) ) 

(CF2)-T3*FL0AT(T ) ) 

) 

)**0.5 
OEV,NOF ) 



REAT,LPTR,LI ST, NCF, T CTELM ) 

TEST. RANQUE RANKS THE OATA 
AND RANKS THE RANGES OF THE 

PARCOM HAS PAIRWISE COMPARISON 
KS, TO TEST FOR EQUAL EFFECT 
PTR(17) ,K,B,I,J,RI,K,E,TCTELM, 
BLOCK 

,SVALS(400) , R(400),B2,A2,RJ,NUM, 
00) ,A1,B1,S( 400) ,SIJ ,SJ2,SJ,MEAN , 
,STOEV,SRJ( 17) 

TEST**' ) 
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E=LPTR(2 )-i 
A1=0 
81=0 
SJ2=0 
71=0 
CF1=K-1 
CF2=DF1-* (E-l» 

EPSI LN=0,CCi 
MeAN=0.5«(FL0AT( ( K+1 JJ » 

CALL RANCLE(TCTELP,LIST,SVALS,R, QtK£Y,K» 8,3J 
DO 10 J=ltK 
SJ=0 

DO 15 1=1,8 

RI =I+LP7R( J J-1 

SIJ = G( I )*{ R(RI )-MEAN) 

A1=A1+SIJ*SIJ 
SJ = £J + S IJ 

WRITE(9, 101 ) I, J,RI,R(RI),Q(IJ ,SIJ, A1 

101 FORMATUX,' I , J ,RI ' ,31 3 ,* R ,Q( IJ ,SIJ ,A1 • ,4F9.3 ) 

15 CONTINUE 

SRJ( J) =SJ 
SJ2=SJ2+SJ*S0 
10 CONTINUE 

81=5 J2/( FLCATIBJ J 
CIF= Al-Bl 

IF ( DIF. LT.EP5ILN J GO TO 95 
Tl=( FL0AT<8-1 JJ^Bl/0 IF 
95 WRITE (9,102) A1 , Bl, Tl, CFl, 0F2 

WRITE (NCF,102) A1 ,B1 ,T1 ,DF1 ,0F2 

102 FORM AT( IX, ‘A1 *,F1C.3 , • 31*,F10.3/ 

=!« IX,' TEST STATISTIC T1:*,F10.3,* OFl',15,' 0F2',I5) 
NUMVAR=2.C*( Al-81 )«P LOAT(b) 

CENV AR=FLCAT( CF2 ) 

STDE V=( NLI^ VAR /DEN VAR )*=«‘0.5 
CALL PARCCP(SPJ,K,STOEV,NOF) 

RETURN 

END 

c- 

SUBROUT INE CO CHRN (NR CW, NCOL , RCCT , I P ,RP ,NOF) 

C THIS SLBROLTINE TAKES ROwAND COLUMN COUNTS RCCT 

C FOR SUCCESSES, DERIVED IN CELLER, AND PERFORMS THE 

C COMPUTATICN OF THE CHI SQUARED STATISTIC 

INTEGER SCJSQ ,SRISQ, RCCT (50) ,NSU CC, NROW , N COL , NUM , 
*NOF, IP( 50) ,DENOM,DF 
REAL CHI,RF(5C) 

NSUCC=0 

SCJSQ=0 

SRISQ=0 

DO 10 J=1,NC0L 
IC=J+NRCW 

SCJSQ = SCJSC+RCCT( 10**2 
10 CONTINUE 

CO 2 0 1 = 1, NROW 

NSXC = NSUCC+RCCT( I ) 

SRISQ=SRISC+RCCT( I )**2 
2 0 CONTINUE 

WRIT£(NGF ,101 ) SRISQ , SCJSQ, NSUCC ,NCOL 

101 FORM AT( lOX, '**COCHARAN TEST***, /IX, 

**SUM OF THE SCUARES: R(I)',I5,*j C ( J ) * , I 5/lX , ‘FOR ' , I 5 
* • SUCCESSES IN', 15,* TREATMENTS') 

NUM= (NC0L*(NCCL-1 )*SCJSQ )-( ( NCQL ’ 1) *NSUC C*NSUCC ) 
D£NOH=(NCCL*NSUCC) -SRISQ 
CHI=FL0AT(NUM)/FLCAT (DENOM) 

CF=NC0L-1 

WRITE(N0F ,102 ) NUM ,D ENOM ,CH I ,DF 

102 FORMATdX,' NUMERATOR ,OENOMI NATOR: ', 215/ IX, • CHI ', 
*'3GUARE0 VALUE:* ,F10 .3/lX'DEGREES OF FRE E COM : ' , I 5 ) 
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PETURN 

END 

c-/-/-/-/- 

SUBROUTINE RCCNT ( RCC T ,NRGW, NCOL , NCFl 
C THIS SUBROUTINE PERFORMS THE CHI-SQUATEC TEST ON CELL 
C COUNTS GIVEN EY THE VECTOR RCCT (ROW C COLUMN CELL COUNT) 
INTEGER RCCT ( 50) f NRO W ,NC CL , NCF ,M AX , RC ( 50 I tCC( 50) ,0F, 
«ULtLL 

REAL EIJ,CI-I 
MAX=NROW 

IF (NCOL. GT.NRCW) MAX=NCOL 
CO 10 1 = 1, MAX 
RCd ) =C 
CC(I )=C 
10 CONTINUE 

WRITE(9,10S) NROWtNCOL 
WRIT E(NOF, 102 ) NRCW, NCOL 

102 FORMATdX,* IM RCCNT: NROW, N COL : • ,2 1 5 , • RCCT:*) 

CO 15 1 = 1, NROW 

LL=1+NCCL* (I-l ) 

UL=NCCL=*I 

WRITE(S,102) ( RCCT(K ) ,K=LL,UL ) 

WRITE (NCF, 103 ) (R CCT ( K ) , K = LL , UL ) 

103 FORMATdX, Ifcl 5) 

15 CONTINUE 

N=0 

CO 20 1 = 1, NROW 
DO 22 J=l, NCOL 

K = J + ( I-l ) ♦NCCL 
CC(J) =CC( J)+RCCT(K) 

RC(I)=RC( I)+RCCT(K) 

N=N+RCCT(K) 

22 CONTINUE 

2 0 CONTINUE 
CHI=0.0 

CO 30 1 = 1, NRCW 
DC 32 . = 1 ,MCOL 

EIJ=( FLOAT(CC( J )*RC( I ) ) ) /FLOAT(N ) 

K = J + ( I-l )*NCGL 

CFI=CFI+( (FL CAT ( RCCT (K) ) -£I J ) «*2 ) /E I J 
WRITE(S,67t) I ,J, K,EIJ,RCCT(K ) 

6 78 FORMAT (IX, • I , J ,K, E IJ , CIJ : • ,3 I 5, 2F9 .3) 

32 CONTINUE 

30 CONTINUE 

DF=( NR0W-1)*( NCOL-1) 

WRIT£(9,1C1) CHI,DF 
WRIT E(NOF ,101 ) CHI, OF 

101 FORMATdX,* CFI-SCUARED VALU £* , F 10.3 , 

* • DEGREES OF FREEDOM*, 15) 

PETURN 

END 

C-/-/-/-/-/-/-/-/-/-/-/-/- 

SUBROUTINE GO FIT ( LIS T ,L PTR, I P, RP , NOF ) 

C THIS GROUP Cr SUBS FINDS PARAMETERS IF NECESSARY AND 
C COMPARES AN ACTUAU DISTRIBUTION TO A THEORETICAL CIST. 

C AVAILABLE THEORETICAL DISTRIBUTIONS ARE NCRMA L, UN IFORM, 

C EXPONENTIAL. SAMPLE SIZE 15 OR MORE DESIRED. 

C IP(1)=# CIST; IP(2)=# ESTIMATED PAR AMS ; I P ( 3 ) =N OF ERLANG 
C RPd ) = MU, A, LAMBDA; RP ( 2) =VAR , 6, A LPHA ; RP(3) = CCNF COEF 

REAL RP( 50, LIST (400 ) ,C UMP( 8 0 ) , P ARTV ( 80) ,CHI , MU, VAR 
INTEGER LFTR( 17) , N ,E J (SO ) ,£S TPAR , NOI ST ,N C ELL , NOF , DF, 
*IP(50),RCCT(8C) 

WRIT 5(N0F ,523 I 

523 FORMATdX, •<‘*GOODNESS OF FIT TEST (Chi SGUARE)***) 
N=LPTR( 2 )-l 
CHI=0.0 

CALL CPVGENIN ,CUMP,c J,NC£LL) 
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CALL MUVAF { LI «T,N ,MU ,VAR » 

RP (4 )=MU 
RP (5 )=VAP 
\DIST=IP (1) 

IF (NOIST.EC.l) 

* CALL iNCFMD(CUMP,I PfRP,PARTV,LIST,N,NC£LL,NCF) 

IF (NCIST.cQ.2) 

* CALL UMD(CUMP,iP,RP,PARTV,LIST,NfNCELL,NOF) 

IF ( NOIS7.EQ.3) 

* CALL EXPC(CU^P, IP,RP,PARTV,LlST,NtNCELL,NOF) 

IF (NDIST,EQ#AI 

* CALL EBLu(CUMP,IP ,RP,PARTVtLI ST,N,NC£LL,NCF) 

IF (NDIST.EQ.5) 

* CALL WEIDICUMP, IP ,fiP,PARTV, LIST, N,NCELL,NCF) 

CALL CELCNKRCCT ,LIS T ,N ,NCELL,PARTV I 

V^RIT E(NOF ,121 ) 

121 FORMATdX,' CELL NR, EJ(J» , 0(JI , PARTITION VALUE 

* 27(*-' )) 

CO 25 I=1,NCELL 

WRIT£(NCF, 122) I , E J( I ) ,RCCT ( I I 

122 FORMAT* 1X,2( 15,' T* ) ) 

I F*I .LT.NCELL ) WRITE (NOF,123) PARTV(I) 

123 FORMAT* IX, 3( • |*),F10. 3) 

25 CONTINUE 

WRIT£(NGF,124) N, N 

124 FORMAT*27* )/9X,2I8) 

CO 10 I=1,ACELL 

CHI = CHI + FLCAT * *EJ ( D-RCCT (I ) ) #*2 ) /FLOAT* £J *I ) ) 

10 CONTINUE 

CF=NC£LL-IF*2 )-l 
WRIT£*9,101) N,NCELL ,CHI ,DF 
WRIT E(NOF, 101 ) N,NCELL,ChI,DF 

lOi FQRMAT*lX,‘WnH SAMPLE LENGTF',14,* C IV I 0 ED INTO*, 13, 
*• CELLS'/IX, 'CHI-SQUARED VALUE 0 F * ,F 1C. 3 / IX , * DF = * , 1 5 ) 
IF*IP(2) .EC.O ) WRITE (NOF, 112 ) RP(4),RP*5) 

112 FORMAT* i>/lX, • SAMPLE MEAN AND VARI ANCE: * ,2Fi0.4) 
RETURN 
END 
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C AND 
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SUBROUTINE CPVGEN*N, LUMP , EJ , NCEL L ) 

SUBROUTINE FINOS THE EXPECTED CELL CCUNT VECTOR EJ 
THE THE CUMULATIVE PROBAEILITY VECTOR CUMP 
REAL LI ST *400 »,CUMP* 80) ,PARTV*80 J,ND5,FN 
INTEGER LPTR* 17) ,N,E J*80) ,ES TPAR ,NDI ST ,NC ELL , NOF ,IND 5 , 
CUM*80 ) , INC EL,MER, REM 
FN=F LGATtN) 

IF *N.GT.52) GO TO 23 
N05=FN/5 .0 
IND5 = IFn<<ND5 ) 

BEM=N-* 5 * IND5 ) 

NCELL=INC5 

IF *REM.GT.2) NCELL=IND5+1 
CO 12 I=1,NCELL 
£ J*I ) =5 
CGNT INUE 

IF (REM. EC.O) GO TO 20 
I F*REM.LE.2) EJ*1 ) =6 
IF*REM.GE.2) EJ*1)=4 
IF*REM.EQ.2) EJ*NCELL)=6 
IF*REM.£Q.3) £J*NCELL)=4 
CONT INUE 
GO TC 29 
NCELL=10 

INCEL=IF IX *FN/10 .0) 

REM= N-10*INCEL 
MER= 10-REM 
CO 24 1=1,10 



189 



I F ( I .LE.MER) EJ( I J=INCEL 
IF (I.GT.MERI EJ<n=INCEL+l 
24 CONTINUE 

29 C'JM< 1)=E J (1 ) 

CUMP (1) =CLI«(1 )/FN 
CO 30 I=2,NCELL 

IM1=I-1 

CUM( n = EJ ( 1I + CCM< IMl ) 

CUMP( I) = FLCAT(CUM( I) )/FN 

30 CONTINUE 

WRITE (9,101) N.NCELL 

101 FORMATCIX,* CLMP GENERATOR FOR N=',I5, 

* • )#CELLS • ,15 ,* CLMP VECTOR:*) 

WRITE(9,102) (CUMPd ) ,I=1,NCELU 

102 FORMdTdX ,8F7 .3) 

RETURN 

END __ __ _ _ 

SUBROUTINE NO FMO ( CUMP ,I P ,RP , PART V , LI ST ,N , NC ELL, NCF) 

C THIS SUBROUTINE FINOS THE PARTITIONS VALUES FCR DIVIDING 
C THE SAMPLE I NTC CELLS, BASED ON THE CUMULATIVE PRCB VECTOR 
C CUMP THE PARTITION VECTOR PARTV IS NCELL-1 LONG 

REAL LIST(400 ),CUMP( 80) , PART V( 30 ) ,RP ( 50 ) , STDE V,MU, VA R 
INTEGER N,NCELL,IP(50) 

WRIT E(NOF, 524 ) IP(2) 

524 FORMATdX, ‘NCFMAL 0IST*,I3,* PARAMETERS ESTIMATED*) 

IF { iP(2 ).£Q.C) GO TO 5 
MU=RP (4 ) 

VAR=RP(5) 

GO TO 10 
5 NU=RF(1) 

VAR=RP(2 ) 

10 STDE V=VAR««0. 5 
NCM1=NCELL-1 

WRIT£(9,ili) NU, VAR, STOEV,NCd 
111 FORMATdX,* MU ,VAR ,STDEV ,NCM1 * ,3F 8.3,15) 

CO 20 I =1 ,NCM1 

CALL NCBMV (2,Z ,CUMP( I) ) 

P ARTV ( I ) = STDEV«Z+MU 

WRITE (9,121) I , CUMPd ) ,Z,PARTV(I) 

121 FORMATdX, *I,CUMP (I) , Z ,P ARTV( I ) * , 1 5 ,3 F 8. 3 ) 

20 CONTINUE 
RETURN 
EN0_ 

SUBROUtFnE CELCNT(RCCT, list ,N,NCELL, PARTV) 

C THIS ROUTINE SORTS THE SAMPLE AND FIGURES THE CELL COUNT 
C OF THE SAMPLE CORRESPONDING TO THE EJ*S OF THE THEORETICAL 
C DISTRIBUTION 

REAL LIST(^00 ), PARTV (80 ) 

INTEGER RCCT(EO) , N ,N CEL L , KEY (400 ) 



5 

6 
8 

10 



J=1 

CO 5 1 = 1 ,N 
KEY( I ) = I 
CONT INUE 

CALL SHSCRKLJST ,KEY ,N) 

00 6 I=1,NCELL 
RCCT ( I )=0 
CONTINUE 
NCMl=NCELL-l 
CO 20 1 = 1, N 

IF(LIST(I ) .LE.PARTV(NCMl) ) GO TO 10 
RCCT( NCELL)=RCCT(NC£LL)+1 
GC TO 20 
CONTINLc 

IF(LIST( I ).LE.PARTV( J)) GC TO 15 
J=w+1 
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IF (J.LT.NCMl ) GO TO 10 
IF (J. EQ.NCM ) GO TO 8 
15 RCCT ( J ) = RCCTl J )+l 

20 CONTINUE 
RETURN 

END _ __ _ 

SU6R0UTI NE UNIOI CUMP , I P t RP, PART V , LI S T ,N, NC£LL,NQF) 

C THIS SUBROUTINE FINDS THE PARTITIONS VALUES FOR DIVIDING 
C THE SAMPLE INTC CELLS t BASED ON THE CUMULATIVE PRC8 VECTOR 
C CUMP THE PARTITIG^ VECTOR PARTV IS NCcLL-1 LCNG 

REAL LISTIAOO ),CUMP( 80 ) , P ART V( 80 J ,RP ( 50 ) ,STD£V,A,B,C 
INTEGER NfNCELL, IP(50) 

KRITE(NOF,52A J IP(2} 

52A. FORM AT( IX, *UN IFORM D 1ST , * ,13 , * PARAMETERS ESTIMATED') 
A=RP (1) 

E=RP (2) 

C=S-A 

.NCMi=NC£LL-l 
CO 20 I=1,NCM1 

PARTV(I)=CLMP( I)«C+A 
20 CONTINUE 
RETURN 
END 

SUBROUTINE EXPO! CUMP , IP, RP, PARTV ,LIST,N, NCELL ,NQF ) 

C THIS SUBROUTINE FINDS THE PARTITIONS VALUES FOR DIVIDING 
C THE SAMPLE INTO CELLS, BASED ON THE CUMULATIVE PRCB 
C VECTOR CUMP THE PARTITION VECTOR PARTV IS NCELL-1 LONG 
REAL LIST(AOO) ,CUMP( 80) ,PARTV(80 ) ,RP (50) , LAMBDA 
INTEGER N, NCELL, IP(50) 
k>RITE(9 ,524) IP<2) 

524 FORMAT! IX, 'EXFONENTI AL DISTR IBUT ION , • , 13 , 

PARAMETERS ESTIMATED') 

LAMBDA=RP(1) 

IF (IP(2).£Q.C) LAMBDA = 1.0/RP( 4) 

NCMl =NC£LL-i 
CO 2 0 1 = 1, NCMl 

PARTV ( I )=( AlOG(1-CUMP( I ) ) )/(- LAMBDA) 

20 CONTINUE 
RETURN 
END 

SUBRCUTI NE £RLO( CUMP , I P , RP , PART V , LI S T ,N, NCELL ,NOF ) 

C THIS FINDS THE X TO CCRRESPONED TO THE F CF CLMP 

C BY NEWTON-RAPFSQN SEARCH IN ERLNUT, TO YIELD PARTV 

REAL RP( 50 , LIST (400 ) ,CUMP( 8 0) ,PARTV( 80) ,CHI ,DN,F,X, 
«LAMBOA,FN 

INTEGER LPTR( 17) , N , E J (8 0 ) , ES TPAR ,NOI ST , NC ELL , NCF , OF , 
♦IP(50),RCCT( aC),FACT ,NCM1 
NCMl =NCELL-1 
FN=FL0AT(IF(3 )) 

LAMBDA=RP(1) 

IF(I P(1 ) .EG.l ) L AM6DA=FN/RP( 4) 

NN=I P(3) -1 
FACT=1 
CO 5 1 = 1 ,NN 
FACT=FACT#I 
5 CONTINUE 

FN=FLOAT(FACT ) 

CO 10 1 = 1, NCMl 
F=CUMP( I ) 

CALL ERLNUT(F ,X,NN,FN, LAMBDA) 

PARTV (I) = X 
10 CONTINUE 
RETURN 
END 
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SU8RQUTI N£ ERLNUT (F, X »NN,FN, LAMBOAl 
C THIS IS THE NEWTON-RAPHSON TECH WHICH TAKES AN F AND 
C RETURNS AN X; NN=^-l ; FN=(N-1» 

REAL F, X, FNt LAMBDA, XLO, XHI , XMED , NF ,0 1 F ,A C IF , S LOPE , 

♦ L,LX ,LXN,FACT 
INTEGER NN,N 
NF=FLOAT (NN ) 

EPSI LN=0,0001 
L=LAMBDA 
X=(N F+1 .C )/L 
IF(F.GT,0*7J X=X<‘2*F 
IF(F.LT.0.3) >=X*3*F 
XL0=0 

XHI=100. C«X 
CONT INUE 
LX=L*X 
L XN=0 
FACT=1 

00 15 = 

FACT=FACT*FLCAT( J) 

LXiN=L>N+{ LX»*J) /FACT 
CONTINLE 

FX=i.O-({L>N<-l.)*EXP(-LX) ) 

DIF=F-FX 
AOIF=AeS(D IF» 

SLOPES (L«( LX«*NN) ♦EX?(-LXn/FN 
IFIDIF.GT.CJ XLG=X 
IF(DIF.LE.C) XHI=X 
X=X+DIF/SLCPE 

1 F(( X,LT*XLCi .OR. (X.GT.XHD) X=0 . 5* I X LC+XH I J 
WRITS (9, 10 I) F,FX ,DI F , SLOPE , XlO , X ,XHI 

FORMAT! • F,FX , OIF, SLOPE ,XLO, X,XH I: * , 3F 11 . 7/ 20X, 4F11. 7i 
IF(AOIF. GT.EPSILN) GO TO 10 
RETURN 
END 



SUBROUTINE WEIOICUMP, IP , RP , P ART V , L I ST ,N, NCELL,NOFJ 
C THIS SUBROUTINE FINOS THE PARTITIONS VALUES FOR CIVIDING 
C THE SAMPLE INTO CELLS, BASED ON THE CUMULATIVE PRCB 
C VECTOR CUMP TFE PART ITION VECTOR PAR TV IS NCELL-1 LONG 
REAL LI ST(4C0 J ,CUMP( 80) ,PARTV(80 ) ,RP (50) , LAMBDA, 

* APOW,ALPF,F, > 

INTEGER N,NCELL, IF(50) 

WRITE(9,52A) IP(2) 

524 F0R)-IAT(1X, ‘WEIBULL D ISTR IBUT ION , • , 13 , 

* • PARAMETERS ESTIMATED') 

LAMB0A=RP(2) 

AP0W=1.0/FF(1 ) 

NCM1=NC£LL-1 
CO 20 I=1,NCM1 
F=CUMP ( I ) 

X=( (ALOGd. C-F) )*^APOW) /(-LAMBDA) 

PARTV ( I )=X 
20 CONTINUE 
RETURN 
END 

c- /-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/- /- 

SUBROUTINE WILC0X(LI ST,LPTR, IP,RP,NOF) 

C THIS USES RANQUE TO RANK THE ABSOLUTE VALUES OF 
C THE 1ST TREATMENT MINUS THE 20 TREATMENT, THEN SUMS THE 
C RANKS OF THE PCS DIFFERENCES AND THE NEG DIFFERENCES 
C TO GET A T VALUE. THE VAR SHIFT ALLOWS A CONSTANT TO BE 
C ADDED TO THE 1ST TREATMENT, E.G. , TO SEE IF THE 1ST TREAT 
C PLUS A CONSTANT IS SIGNIFICANTLY LARGER THAN THE 2D TREAT 
REAL LiST(400 J,RP (50 ) ,A0IF(200) , SVALS (40 0 ) , RANK ( 400 ) , 

* SRPOS,SRN£G, SRANK,SRSQ ,P,PR ,Z,ALPH1 , ALP H2, AL FMI N, 



10 



15 



101 
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*CIF, ALPHATfSHJFT 

INTEGER LFTR( 17) , KEY ( 400 ) , I P ( 50 I , NOF , Q ( 2 00» , 

* NNEG,NPCS,NOCiF ,POSDIf (200) ,IR 
SRANK=O.Q 

SRSQ =0. 0 

EPSI LN=0.CC01 

IR=0 

NNEG=0 

SRPOS=0.0 

SRNEG=0. 0 

NP0S=0 

NVALS=LPTF(3)-1 
NBL0CK=LFTP(2 )-l 
SHIFT=RP ( 1) 

CO 10 I=1,^BLCCK 
K=I+NBLCCK 

0IF=L1ST(K )+SHIFT-LIST(I ) 

ABDIF=A6S( CIF ) 

I F(AB0IF.L7.£PSILN) GC TO 10 
I R=IR+1 

IF(OIF,LE,C) GC TC 13 
NPCS=NF0S+1 
PCSCIF (IR)=1 

13 I F(OIF.GE.C ) GC TC 15 

NNcG=KNEG*l 
FCSDIF(IR)=-1 
15 ADIF( IR)=A£S( OIF) 

10 CONTINUE 

NOOI F=NBLCCK-^POS-NNEG 

CALL RANCUE( IFfAOIF, SVALStRANK,Q ,KE Y , 1 ,N3 LOCK ,1 ) 
V<RITE(9,211) (AOIFd )tI = l,IR ) 

V»RIT£(9,211) (RANKd ) ,1=1 ,IR) 

211 FORM AT( IX, 'DIF/RNK:* t 7F8.2) 

CO 20 I=1,N6LCCK 

P=FLOAT(POSOIF(I) ) 

IF(P.GT.O.l) SRPOS=SRPOS-*-RANK (I ) 

I F(P.LT.-O.l) SRN£G=SRNEG+RANK(I ) 

PR=P#RANK( 1) 

SRANK=SPANK+PR 
SRSQ=SRSQ+PR*PR 
ACIF( I ) = A0 IF( I )*P 
20 CONTINUE 

WRIT E(NOFt 101 ) NBLCCKtSHIFT,NPCS ,NNEG ,NOC IF , SPPOS , 

» SRN E G 

101 FORMAT( 10X,*«=>wILCOXCN SIGNED RANK TEST#**/1X. 

* 'IN*, 14,* PAIRS ,•/! X,* WITH 1ST VARIABLE SUBTRACTED', 

FROM THE 2C PLUS A SHIFT 0 F' , F 9 .3/ IX , • THERE ARE', 

* 14,' PCSITIVE DIFF ERENCES , • ,14 , • NEGATIVES, AND'/ 

* 5X,I4,' TIE (S) • /IX , 'SUM OF POSITIVE RANKS :• ,F1 0.2/ 

« IX, 'SUM CF NEGATIVE RANKS FI 0.2 ) 

IF ( N6L0CK.lt. 20) GO TO 50 
2 = SR ANK/ (SRSC<*0.5) 

CALL NORMVd , Z,ALFH1 ) 

ALPH2=1. C-ALPFl 
ALFMIN=ALFH1 

IF<ALPH1.GT.ALPH2 ) ALFMIN=ALPH2 
IF(HYP.EQ.i) /LPHAT=2.0«ALFMIN 
IF(HYP.EC.2) ALPHAT=ALPH1 
IF(HYP.EC.3) /LPHAT=ALPH2 
ALPHYP=1 .C-RP (2) 

WRITE (NCF,102) SRANK ,SRSG, Z , ALPhAT , ALPHYP 

102 FORMAT( IX, •SU^ OF SIGNED RANKS:' ,F10.2/ 

SUM OF SQUARED SIGNED fi ANK S: ' , F 10 . 2/ • T:',F10.3/ 

ALPHA HAT:' , Flo. 3, 3X, 'HYPOTHESISED ALP HA: ' , FIO .3 ) 
IF(HYP.EG.l) WRITE ( NOF , 131 ) 

IF(HYP.£C.2) WRITE ( NCF, 132) 

IF(HYP.£G.2) WRITE (NOF, 133) 



193 



131 FCRMAK IX, *H0 : E(X) = E(Y), HI: E(XK>E(Y)M 

132 FORM AT( IXt'HO : E(XJ>=£(Y), HI: E(XXE(Y)') 

133 FORMAK IX t'HO : E(XX=E(Y), HI: EU)>E(Y)') 

IF (A LPHYP.GT. ALPHA!) URITE (N0F,121) 

IF(A LPHYP.LE. ALPHA!) V^RI!£ (N0F,122) 

121 FORMA!(iXt‘ REJEC! !H£ NULL HYPO!hESIS‘) 

122 F0RMA!(1X,* DC NO! REJEC! !HE NULL HYPO!HESIS') 

GO !C 95 

50 V<RI!E(NOF ,103 ) 

CALL RANGUE(NELOCK,ADIF, SVALS,RANK,Q ,KEY , i,NBLOCK,i) 
URI!E(NOF ,104 ) ( S VAL S ( I ) , 1=1 ,NBLCCK) 

103 FORMA!( iX, 'BECAUSE OF SMALL SAMPLE SIZE, !ABL£S •/ 

* IX,' SHCULO EE USED. !HE DIFFERENCES ( NC! !hE'/ 

# 1X,'ABSCLL!E DIFFERENCES) FOLLOW IN RANK ORDER: •) 

104 FORMA!( 1X,5F1C.3) 

95 R£!URN 

END 

C-/-/- 

SUBROU!INE MEDIA ( N! REA! , LI S !, LP !R , N I P , I P ,N0 F ) 

C !HIS SUBROU!I.N£ FINOS !HE MEDIAN VALUE CF A NUMBER OF 
C VARIABLES/!REA!MEN!S, FINDS CELL COUN! OF #A8CVE/8EL0W 
C GRANC MEDIAN IN EACH IRE A!, AND PERFORMS A R*C CONIiNGENCY 
C !ES! ON !HA! CELL COUN! MAIRIX, USING FORM P.172 CONNCVER 
C IN ORDER !0 ALLOW GROUPING CF VARIABLES IN!0 AGGREGAIE 
C VARIABLES; !HE AGVARI VECIOR CONIAINS !HE INDEX CF AGVAR 
C FOR EACH VAR (E.G., VAR 1 S3 IN !0 AGVAR 1, VAR 2,4,5 IN!0 
C AGVAR 2; !HE AGVARI IS 1,2, 1,2, 2). !HIS INFORMAIICN IS IN 
C CONIAINEO IN !HE IP VECIOR. NAGVAR IS !H£ NUMBER CF 
C !HE NUMBER OF AGGFEGAIE VARIABLES! E. G. , HERE, 2) 

C IF AGGREGAIE VARIABLES ARE NOT USED , MP = NUMB £R CF INTEGER 
C VARIABLES=0, NAGV AR=N!RE A!= NUMBER 

C OF VARIABLES IN OAIA SE! ; AGVARI IS 1, 2 , . . . , N TREAT 
C NAV(I) IS THE NUMEER OF ELEMENTS IN AGVAR(I) 

C NA(I) IS 4 ELMS ABOVE MEDIAN IN AGV AR ( I ) ; NB( I ) = J#B ELOW 
C Q50 IS GRAND MED; 

INTEGER MEDl, ME02,NB ( 15) ,NA( 15) , LL,UL,DF ,NAV( 17) , 

*LP!R (17) ,KEY( 400 ) ,AGVARI(17) , NAG VAR, IP(17),NIP 
#A,B, N,N!REA!,NN, AN,AB 
REAL LI ST (400 ),V ,NUM ,!,CONS! ,SVA L ( 40 0 ) ,Q 5 C 
A=0 
B=0 
!=0 

IF ( NIP.GT.O ) GO !C 3 
NAGVAR = MREA! 

DO 2 I=1,NIR£A! 

AGVAR Id ) =I 
NAV( I 1=0 

2 CONTINLE 
GO TO 1C 

3 CONTINUE 

NAGVAR=0 

00 4 I=1,N7REA! 

AGVAR ]( I ) = IP ( I ) 

IF ( IF(1 ).GT. NAGVAR) NAGVAR=IP(1) 

NAV( I )=0 

4 CONTINUE 

10 DO 15 1 = 1, NAGVAR 
NBd )=C 
NA(I ) =0 
15 CONTINUE 

N=LPTR( (NTREA7+1 ) )-l 
MED1=IFI X(C.5^FL0AT( N+i ) ) 

MED2=IFI X( 0. 5<FL0AT( N + 2 ) ) 

CO 18 1=1, N 
KEY( I ) = I 
SVAL( I)=LIST( I ) 

18 CONTINUE 
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CALL SHSCFT(S VAL tKEY ,N) 

Q50=0.5#( SVAL (ME01)+SVAL« ME02) < 

C WRIT£(N0F,111 ) M£Dl«ME02.SVAL<MEDi) ,SVAL(M£D2)tQ50 

cm FORMATi 1 X, 'M£C1,M£D2 * ,2 1 5 , ' S VAL ( 1 ,2 J , MED JAN' .3F10.3) 
DO 30 I=1,NTREAT 
LL=LPTF (I ) 

UL=LPTR( (1+1) )-l 
K = AGVAR Id J 
N AV( K )=NAV (K) +UL-LL + 1 
DO 20 J = LL fUL 
V=LIS1(J ) 

IF(V,GT.Q50) NA(KJ=NA(K) +i 
IF(V,L£.Q50) N6(K)=N8(KJ +1 

CONTINLE 
CONTINUE 

CO 35 I=1,NAGVAR 
A =A+NA ( I ) 

6=B+Nem 
CONT INUE 
NN=N*N 
A8=A«B 

CONST=FLCAT(N^)/FLOAT(AB ) 

CO 50 I = 1,NA6\(AR 

IF (NAVd >• EQ, 01 GO TO 50 
AN=A*NAV ( I J 

NUM=( FLCAT fINAm » l-FLOAT (AN I /FLOAT (N ) )**2 
T=T♦Ca^ST«^UM /FLOAT! NAV( I ) i 
50 CONTINUE 

CF=NAGVAR-1 

i^RIT E(NQF,101 ) Q50,NAGVAR 

101 FORMAT! IX , 'THE MEDIAN VALUE CF THE TEST I S* , F 10 • 3/ IX , 
#'THE CELL COUNTS OF THE', 13, 

*• TREATMENTS CR AGGREGATES ARE:'} 

WRITE !NCF,102) ! I ,I =1 , NAGV AR ) 

102 FORM AT! 13X,16 14J 
WRIT £ (NO F, 104 )!NA!I) ,1=1, NAG VAR) ,A 



20 

30 



35 



104 FORMAT! IX,' > MEDIAN 



' ,1614) 



103 

105 



WRITE!N0F,103 )(N8!I) ,1=1, NAG VAR) ,8 
WRIT c!NOF ,105 ) !NAV( I ) , I =1 , N AGVA R ) , N 
FORMATdX,' =< MED IAN' , 161 4) 



FQRMATdX,' TCTAlS 



' ,1614) 



WRITE !NCF,106) T,DF 
106 FORM AT! IX/IX, 'CHI-SQUARED VA LUE ' , FIO . 3/ 

*1X,' DEGREES OF FREEDOM', 15) 

RETURN 

END 

C-/-/- /-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/- /- 
SUBROUTINE KRLWALILI ST,LPTR, IP,RP ,NOF ,NTREAT , TREAT) 

C THIS SUMS IHE RANKS OF EACH TREATMENT AND 

C THE SQUARE OF ALL RANKS TC FIND THE OVERALL CFI SCUARE 
C STATISTIC FOR THE NULL HYPOTHESIS THAT ALL TREATMENTS 
C HAVE AN EQUAL MEAN. IF REJECT THAT NULL, THEN THE 
C ADDITIONAL NUMBERS COMPUTED HAVE SOME MEANING 

REAL TERM2,R, SSR,SSQ ,T, LI ST (400) ,RP ! 50) , RANK! 400) , 

* SRIDNI ,C(20C),FN(17),SVALS!400 ),RNI(15) 

INTEGER LPTR! 17) , IP! 50) ,NCF , TREA T ! 15 ) ,NT R EAT , N J ! 17) , 

♦ N,UL,LL,K£Y( 400 ) ,DF1,DF2 
N=LPTR! (N'TREAT+1) )-l 

OF 1=N-1 
CF2=N-NTREAT 

CALL RAN CUE !N ,LI ST,SVALS ,RAN K ,Q , KEY , NTRE AT, 0 , 1) 

CO 10 I=i,NTREAT 
RNI! I )=0.0 
10 CONTINUE 
SSR=0.0 
SRIDNI = 0 .0 
CO 20 I=1,NTREAT 
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LL=LPTRm 
UL=LPTRlil4i) )-i 
N J(I ) =LL+1-LL 
FN(I ) =FL04T(NJ(I) ) 

OG 30 J=LL,UL 
R = ?ANI<( J ) 

RM (I J=RNHI )+R 
SSR = SSR+R=»R 
30 CONTINUE 

SRIONI = SRICNI + {RNI (I)*RNI (I) ) /FLOATCNJU ) i 
20 CONTINUE 

TERM2=FLC;iT(N=»(N + l (N+1 > )/4,0 
SSQ= (SSR-TSRM2)/FL0AT(N-1) 

T=(SRi0NI-T£R.*'2)/SSQ 

WRITE (NCF.lOl) NfNTREAT, SSR 

101 FORM AT( 10;<f •«=»KRUSKAL-WALLIS TES T=«'=«' * / IX, • IN SET OF', 
*15,' DATA kE^ENTS, DIVIDED INTO', 15,* SAMPLES'/IX, 
*'THE SUM CF SCUAREO RANKS WAS*,F10.3, 

*« THE SUMS OF RANKS/ SAMPLE: • / 

«/lX* VAR NR‘,5X,‘NR DATA ELEMS * , 5X* SUM OF RANKS') 

CO AO I=1,NTREAT 

WRITE (NCf,102) TREAT (I ) ,NJ< I», RNK I ) 

102 FORMAT( iX,I5, £X, I5,12X,F8.2) 

40 CONTINUE 

WRITEiNOF ,103 ) T,CF1,DF2 

103 FORMAT( iX/lX, 'YI ELD A CHI SQUARE VALUE OF :' ,F 10. 3/iX , 

=*= 'WITH', 15,* CEGREES OF FREEDOM. IF AND CNLY IF'/IX, 

* 'NULL HYFCTHcSIS REJECTED, FOL NUMBERS CAN'/iX, 

* 'TEST WHETHER SPECIFIC PAIRS OF VARIABLES HAVE'/iX, 

* 'THE SAME MEAN. IF A8S DIFFERENCE IN MEAN RANKS '/IX, 

- ' iR (I) /MIJ-R(J) /N( J) j > POCLED STANDARD DEVIATION'/ 
»1X, 'TIMES THE T QUANTILE l-ALPHA/2 THEN REJECT THE'/ 
*1X,' HYPOTHESIS THAT THE PAIR OF MEANS ARE EQUAL'/IX, 

* ' (DEGREES CF FREEDOM HERE : ' , I 4 , ' ) ' //3 X , ' I ', 

«* J A8S DIF MEAN RANK POOLED STANDARD DEVIATION') 
CONST=( SSC^^*! FLOAT (N) -1. 0-T) / FLOAT (DF 2 ) )**0.5 
NTM1=NTREAT-1 
DO 60 1=1 ,NTM1 
I P1=I +1 

DO 70 J=IP1,NTREAT 

ACMR = ABS ( (RN I (I)/FN( I » ) - (RNK J) / FN( J ) )) 
SCPCOL=CONST«( ( i.O/FNd ) i + { 1 . 0/FN ( J ) i ) 

WRITE (NOF, 104) I , J , A OMR , SDPOOL 

104 FCRMAK2X ,21 3,6X,F3. 3,14X,F10.3) 

70 CONTINUE 

60 CONTINUE 
RETURN 
END 

C-/-/-/-/-/-/ /- 
SUBROUTINE SQLARK { L I ST , LPT R ,NTR EAT , TREAT ,I P , RP, NOF) 

C THIS TESTS WHETHER A DATA SET HAS EQUAL VARIANCE AMONG 
C ALL VARIABLES BY (1) FINDING THE ABSOLUTE DEVIATIONS OF 
C EACH. VAR lABLE 'S ELEMENTS, IN SUBROUTINE ADEV , (2) RANKING 
C THOSE DEVIATIONS (RANCUE), AND (3) S UMM ING , SQLAR I NG , ETC. , 

C THOSE RANKS AS RECUIRED TO FIND THE TEST STATISTIC 

REAL LI ST (400 ),OEV(400) ,RP( 5 0) , F N ,FM ,FNT , R( 400 , 

* SUMRJ(17) ,FNJ(17 ) ,R2 MU, RMU, T 1,T2 ,NUM , fiSQ , SSSR2J , 
*SUMR2J( 17) 

INTEGER LPTR( 17) , NTR E AT , TREAT ( 1 7 ) , I P ( 50 ) , NOF , KE Y ( 400 ) 
NT=LPTR( (NTREAT<-1 ) )-l 
FNT= FLOAT (NT) 

CALL ADEV (LIST, LPTR, NTREAT, DEV) 

CO 3 1 = 1 ,NT 
KEYl I ) = I 
3 CONTINUE 

CALL RNQ(D£V, HEY ,NT, R) 

CO 5 J=1 ,NTREAT 
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10 

101 

102 

40 

50 

103 

104 

95 



12 

30 



SUMR2J( JJ=C 
SUMRJ ( J)=0 
CONTINUE 
SUMR=0.0 
SUMR2=0.0 
SUMR4=0. C 
J=1 

CO 10 1 = 1, NT 
RSQ=R(I)<‘R(i) 

SUMR = SUNR + F(I ) 

SUMR2=ScMR2+RSC 

SLMR4=SLMR^+RSC*RSQ 

INCRJ=LFTR((J+1J) 

I F(I .GE.INCRJ ) J=J+1 

SUMRJ (J) =SUMRJ( JJ+R( I) 

SLMR2 J(J »=SUMR2J( J) +RSQ 

CONTINUE 

R2MU=SUMR2/FNT 

RMU=SUMR/FNT 

IF ( NTR5AT.GT.2) GO TC 40 
FN=FLGAT(LFTR(2J-1) 

FM=FLCAT(L f TR ( 3 )- LPTR ( 2) ) 

0TER.M2 = FN#FM/<FNT-1. 0) 

0TERM1=CT£FM2/FNT 

0£N0M = (CTEFM1*SUMR4-DTERM2★R2MU«R2^<U) =»*0,5 

NUM=SUMR2J (1)-FN*R2MU 

Tl=NUi-/CENCM 

WRITE (5,101) 0TERM1,0TERM2,SUHR4,R2MU ,SUMR2J( 1) 
FORMAT! IX, • DT ERMl -2 , SUMR4 , R2MU , SUM J2. 1S5F5.4) 
WRITE(NCF,102) T1 

F CRMAT( iOX , SQUARED RANK TEST FOR *, 

« ‘EQUAL VARIA^CE =»♦«**/ 1 X ,' T EST STATISTIC *, 

* 'FOR THIS TWC SAMPLE DATA SET iS',F10.4) 

GO TO 9 5 

CONT INUE 

0 SQ=( SlMR4-FNT*R2MU=4'R2MU) /(FNT-1. 0) 

SSJMU=C.O 

DC 50 u = l , ^TREAT 

SSSR2J=SUMR2J(J l«SUMR2J( J) 

SSJMU=SSJMU+SSSR2J/FL0AT(LPTR( (J+1) )-LPTR(J> ) 
CONTI NLc 

T2=( SSJMU-FNT=i'R2MU=»R2MU) /OSQ 
WRITE (9, 103) SSJMU,R2MU,SUMR4,DSG 

FORMAT! IX, ' S S JU M , R2 MU , SUMR4 ,OSQ • ,2F 10.4 , F20 . 4 ,F 10.4 ) 
WRIT £!NOF, 104 ) NTREAT, T2 
FORMAT! 1 CX , 

« SQUAREC RANK TEST FOR EQUAL VARIANCE 

« • IN A CATA SET OF • , 13, • VAR lABL ES '/ I X, 

* ' THE TEST STATISTIC :• ,F10. 4) 

RETURN 

END 



SUBROUTINE AO EV ! L 1ST , LP TR ,N TREAT , DEV ) 

REAL LIST!400 ) ,RP !50 ) , DEV !400 ) , V ARX , MU ,X ! 200 ) 
INTEGER LPTR! 17) , K ,N TRE AT ,I P { 50 ) ,NOF , LL , I PI , N I 
CO 10 1 = 1, NTREAT 
I P1=I+1 
LL=LPTR! I ) 

NI=LPTF!IP1)-LL 
DO 12 K = 1,M 

X!K) = LIST! (LL+K-1) J 
CCNTINLE 

CALL ML\(AR !X,NI ,MU,VARX) 

DO 30 K=l, M 
J = LL+l«-i 

C£V!J )=ABS(L 1ST! J)-MU) 

CCNT INUE 



L I=LPTR( n 

WRITE (9,105) I ,LI ,LIST(LI ),MU,uEV{LI) 

105 FORMAK iX, »VAF#, LPTR ;U ST,MU,OEV CF i.PTR : • ,21 5 ,3FiO. 4) 
10 CONTINUE 
RETURN 
END 

C-/-/-/-/-/-/-/ — /-/-/-/- 
SUBROUTINE HAFTLEILI ST, LPTR , NTRE AT , TR EAT ,IP,RF,NCF) 

C THIS SUBROUTINE FINOS THE STATISTIC FOR THE HARTLEY'S 
C TEST FOR EQUAL VARIANCE, AN F RATIO BETWEEN THE SMALLEST 
C AND LARGEST VARIANCE IN A SET OF DATA. 

REAL LI ST (400 ) ,RP ( 50 ) ,X ( 200 ) , VM I N , VMAX, V A RX , F RAT IO,M U 
INTEGER LFTR( 17) ,NTR EAT , TREAT (16 ),IP(50) ,NCF, LL,UL, 

« IPl, NI 

VMIN=103CCQ0.C 
VMAX=-10. C 
CO 10 I=1,NTREAT 
I P1=I-H 
LL=LPTR(I) 

NI=LPTF( IPD-LL 
00 12 K=l, M 

X(K)=LIST((LL+K-1)J 
12 CONTINLE 

CALL MLVAR (X,NI ,MU,VARX) 

IF ( VARX.GE.VMINJ GO TO 20 
VMN=VARX 
I VMIN=TkEAT( I ) 

20 I F(VARX.LE .VMAX) GO TO 10 

VMAX=VARX 
I VMAX=TREAT( I ) 

10 CONTINUE 

FRATIG=VNAX/VMN 

WRIT £ (NO F, 101 ) NTREAT,IVMIN, VMIN , I VMAX , VMAX ,F RAT 10 
101 FORMATdOX, 

« HARTLEY TEST FOR EQUAL V ARIANCE^^f'^#' /2X, • IN' , 

A DATA SET WHICH COMPARED ',13,' VARIABLES, '/IX, 
VARIABLE* ,13,' HAD THE SMALLEST VaR IAN C E: * , f 10 .4, 
*','/lX,' WHILE VARIABLE* , 13, • HAD THE LARGEST ;*, FiO. 4 , 
WHICH GIVES AN F RATIO 0F:*,F1C.4) 

RETURN 

END 

C-/-/-/-/-/-/- /- 
SUBROUTINE CO RRE L ( LI ST , LPTR, NTRE AT , TR EAT ,NOF, IP, RP) 

C THIS SUBROUTINE TAKES LIST, COMPOSED OF TWO EQUAL LENGTH 
C VARIABLES, PEPFCRMS P£ARSON*S RHO ON THEM, THEN RANKS 
C EACH VARIABLE, PERFORMS PEARSCN* S RHC ON EACH TO YIELD 
C SPEARMAN*S RHC, AND IF REQUESTED, FIGURES K£NDALL*S TAU 
C IP(1) IS QPTICN:-1=SPEARMAN CNL Y , 0=P EARSCNCS P EARM AN , 

C 1=PEARS0N, SPEARMAN AND KENDALL 

REAL LIST(400 ),RP(50 ),SVALS( 200) ,RANK ( 40 0 ) , B 1 ,B0 ,TAU , 

* T RANKS! 200 ),TVALS (200) ,SRANQ( 200) ,TRANG(200) 

INTEGER LPTR( 17) , KEY ( 400) ,N TREAT , TRE AT ( 1 7 ) , NCF , 

« TK6Y( 2CC), SKEY(200),0PT, IP(50 ) 

NPAIR=LPTR(2)-1 

IF ( IP( 1 ).LT.C) GO TO 5 

WRITE(N0F,201 1 

201 FORMAT! IQX, •^♦CORRELATION*^' ) 

CALL RH0SR(LIST,NPAIR,RH0,B1 ,B0, 0) 

WRITE(NOF ,101 ) RHC 

101 FORMAT! 'PEARSCN'' S CORRELATION ON THE', 

* • ACTUAL VALLES CF TWO VARS :'/ lOX, F 10.3 ) 

5 00 10 I=1,NPAIR 

K=I+NPA IR 
SKEY ( I ) = I 
TKEY! I )=I 
SVALS ( I ) = L 1ST! I) 

T VALS ! I ) = LIST ( K) 
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10 CONTINUE 

CALL RNQ (SVALS, SKcY ,NPAIR,SRANQ) 

CALL RNQ (TVALS,TKEY,NPAIR,TRANQ) 

00 15 1=1 ,NPAIR 
K=I+NFAIR 
RANK( I )=SR/NQ( I ) 

RANK! K >=TR^NQ( I) 

15 CONTINUE 

CALL RHOEP (RA^K, NPAIR,RH0,81 ,80 t 0) 

V«RITE(NCF ,102 ) RHC 

102 FORM^T( 10X,***CORRELATION*** /IX, 'PEARSON"S *♦ 
'CORRELATICN ON THE RANKS OF TWO vARS', 

*/lX, *, i .£. ,SFEARMaN"S RHC;*,F10.3» 

IF { !P( 1 ).LT. IJ GO TO 95 

N0IS=0 

NC0N=0 

CO 20 1=1, NPA IR 
I S=SKEY(II 
I P 1= I + 1 
I SP1 = IS + 1 
NCCNI S=C 
NCIS IS=0 

DC 25 L=IF1,NPAIR 
J = SKE Y(U 

SCIF=«RANUJ )-SRANQ( IS) 

IF(SOIF.LT.O.l) GO TO 25 

70IF=TRANQ( J)-TRANQ (IS) 

IF(TCIF ,LT .0.1 ) GO TO 28 
NCCN=NCCN+1 
NCCNI S=NC0NIS+1 
GO TO 25 

28 IF{ TCIF .GT.-O. 1 ) GO TO 25 

NO IS=N0IS+1 
NDISIS=NDISIS+1 

25 CONTINUE 

WRITE(9,141) IS,NCONIS,NOISIS 

141 FCRMAT( IX, 'INDEX OF X = *,I5, 

» ' ffCQNCORDANT ,DI SCCNCCROANT' , 215) 

20 CONTINUE 

TAU=2.0=f'FLCAT (NCON-N 01 S ) /FLOAT ( N PA IR* (NP A IR- 1 ) ) 
WRIT£(N0F,142 ) NCCN, NOI S , TAU 

142 FORMATdOX KcNCALLS TAU ****'/lX, 

V TOTAL NUM8EF OF CONCORDANT PAI RS: ' , 15/ IX, 

** TOTAL NUfBEf OF DI SCONCOROANT PAIR S : * , I 5/ IX , 

* • TAU = * , FiC.4 ) 

95 RETURN 

END 

C- /-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/- /- 
SUBROUTINE REGRET (LI ST,LPTR, IP, RP, TREAT, NCF) 

C THIS USES RHCER 10 FIND THE LEAST SQUARES FIT UNLESS 
C USER SPECIFIES THE HYPOTHESIZED LINE FOR THE REGRESSION 
C BETWEEN AN INCEP (EXPLANATORY) VARIABLE AND A DEPENDENT 
C VARIABLE. IT TESTS FIT USING THE SPEARMANS RFC TC FIND 
C RANK CORRELATICN BETWEEN INDEPENDENT VARIABLE AND THE 
C RESIDUALS. IF USER REQ* D IT, IT WILL FIND CONFIDENCE 
C BOUNDS ON THE SLOPE (Bl) OF THE FITTED LINE. 

C IP(1):IND£X OF DEPENDENT VARIABLE IN LIST (1 OR 2) 

C IP(2): 0=USE LEAST SQUARES EST 81, 

C 1=HYP Bl; SUPPLIED IN RP ( 1 ) 

C IPO): 0=N0 EST'S OF Y BY X,ELSE = # OF Y'S EST,RP(3 + ) 

C IP(4); 0=NO CCNFICENCE INTERVALS, 1=Y£S 
C RP(2): VALUE CF W TO COMPUTE THE SLOPE INDICES 

INTEGER LFTR( 17) , IP( 50) , NOF , NPA I R , OPT ION , TREAT( 1 5 ) , 

* NPM1,KEY(20C) ,NR,NS,DIP(4) ,NE S TY , I X ,L , K ,i 17 ,1 47 
REAL LIST (400 ),RP (50 ) ,B1,B0,RH0, S( 20000) ,DENOM,AOEM, 

*>,Y, XSUM.YSUM ,EY ,UX( 400) ,QALFD2 ,B1R 
WRIT E(NOF, 201 ) 
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iOi 



678 



9 87 
20 

102 



21 

121 

122 

22 

24 



15/ lx, 
IS: ‘ /IX 



FORM f ‘^‘=fi>iOiN-PA RAMETRIC REGRESSION** • ) 

EPSI LN=0 .CGOl 
CPTICN=IF<i; 

CIP( 1 )=- 1 
XBAR=0 
YBAR=0 
IX=5-0PT ION 
IY=0PTIGN 
NPAI R=LP TR I 2) -1 

CALL RH0ER(LlST,NPAIR,RH0,8i ,B0, OPTION) 
l^RITE (NCFtlOl) TREAT(IY) , TREAT ( IX) , 81,80 
F0RMAT(1X, ^ , 

*» WITH DEPENDENT VARIABLE AS VARIABLE #',I5/1X, 

*• AND EXPLANAIORY/INOEPENOENT VARIABLE AS ' 

*• THE LEAST SCUARES FIT OF LINEAR REGRESSION 
*• DEP VAR = • ,F9.2,' * INOEP VAR + •,Fi0.2) 

B1R=RP( 1 ) 

IF ( IP(2 ).LT.l) aiK=ei 
IXL=LPTR ( IX)-1 
IYL=LPTR (I Y)“ 1 
WRITE(9,673) ]XL,IYL 
FORMATd/,* X & Y IN D IC £S : • , 215 ) 

CO 20 1 = 1 ,wPa IR 
J=NPAli5 + I 
X = LISTUXL*I) 

UX(J ) =X 

Y = LIST(IYL-*I) -81R*X 
UX( I ) =Y 

WRITE(9,987) LX{I)tUX<J) 

FORMATt I X ,*X, RESID: * ,2F10.4) 

CONT INUE 

WRIT£(NOF ,102 ) BIR 

F0RMAT( IX, 'TO TEST CORRELATION BETWEEN THE INCEPEN' 
*»DENT VARIABLc'/lX,* AND THE RESIDUALS OF THE FIT,'/ 
*1X,' WHERE ai=',FlC.4) 

CALL CORREL(UX,LPTR,NTREAT, TREAT ,NOF ,DIP ,RP) 

NESTY = IP O ) 

IFiNESTY. EC.O ) GO TO 24 
CO 21 I = ltNPAIR 

XSUM = XSLM + LIST ( IX L + I ) 

YSUM=YSLM+LIST{IYL*I) 

CONT INUE 

IFiIP(2).EQ«l) BO=(Y SUM-6i*XSUM) / FLO AT ( N P AI R ) 

WRIT £(NGF, 121 ) NESTY,61 

FORMATdX, 'ESTIMATE OF', 13,' DEPENDENT VARIABLES BY 

* 'REGRESSION,' /IX , 'WI TH Bl = * ,FiO. 4) 

CO 22 I=1,NESTY 

£Y=B1*PP( ( 2+1 ) )+B0 
WRITEiNCF, 122) RP((2+I)),EY 
FORMATdX,' INCEP VAR VAL UE: ' , FIO .4, 

* ' , DEP VAR ESTIMATE:* , FIO. 4) 

CONT INUE 

IF(I P(4) . EC.O ) GO TO 95 
I S=0 

QALF02=1.0-( ( 1.0-fiP(2))*0.5) 

N=NPA IR 
I FIN. GT. 60 ) 

* W=QALFD2*( ( FL0AT(N*IN-1)*( 2*N + 5) )/ 18. 0) =»*0. 5) 
I FIN, LE.60 ) W=RPI 2) 

NPM1=NPAIR-1 
DO 30 1 = 1, i'PMl 
LL=I+1 

CO 35 J=LL,NPAIR 

CENOM=L 1ST dXL+I)“L 1ST! IXL + J ) 
AD£M=ABStO£NOM) 

IF! ADEM.LT.EPSILN) GO TO 35 
IS=I S + 1 
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3(IS ) = (LIST( IYL+I)-LIST( lYL + jn/OaNOM 
KcY( 15) =IS 

3 5 CCiNTUUE 

30 CCNTINLE 

CALL SHSORKS tKSY ,15) 

I 17=17 
I 47=47 

wRITEl5,Si6) 5(1 17) ,5(147), I 5 
876 FCRMATdX ,*5LCPE5, /#5L0P £5* , 2F9 . 4 , 1 5 ) 

NR=IFIXO. 5*( IS-W) ) 

N5=IFIX(0.£«( IS+W )+1.96) 

WRIT£(NCF, 103) W , NR , 5 ( NR ) , N5 , 5 (N5 ) 

103 FORMATdX,' 8A5E0 CN A VALUE GF W = • , F 1 C. 4/ IX , 

**INDICE5 CF CCNFICENCE INTERVAL, fl VALUES OF SLOPE:*/ 

♦ IX, ‘LOWES 8CUN0 IN C EX : * , 1 3 , * WITH V ALU E : * , F 10. 4/lX , 

♦ * UPPER aOUND index:*, 13,* WITH VA LLE : ' ,F 10 -4 ) 

95 RETURN 

END 

c- 

SUBROUTINE MQNORE(LI ST,L?TR, IP, R P , TR EAT, NCF ) 

C THIS SUBROUTINE FERFCRMS MCNCTONE REGRESSION BASED ON THE 
C LEAST SQUAREC FIT OF THE RANKS OF 2 VARIABLES, IF THE 
C USER DESIRES, ESTIMATES CF SPECIFIC VALUES OF DEPENDENT 
C VARIABLE Y BASED CN VALUES OF X CONTAINED IN R? VECTOR. 

C ALSO, IF THE USER DESIRES, A TRACE OF THE REGRESSION wILL 
C BE LISTED. 

C IP(1): # OF CcPENCENT VARIABLE ( 1 OR 2) 

C IP(2): # CF SFcCIflED X VALUES FOR EXTIMATION OF Y VALUES 
C IP(3): DO YOU viANl A TRACE? 1=YES 0=N0 

REAL LIST (400 ),X( 200 ),Y(200) ,RX( 2C0) ,RY( 2C0) , TERM2, 

« B2,A2,FN,RP( 50) , 5XX ,SXY 
INTEGER KEYX( 100 ) ,KE YY(2C0) ,NPAI R,IX ,IY, IF( 50 ), 
«LPTR(17) ,T?EAT(17 ) 

NPAIR=LPTR(2) -1 
FN=FL0AT(NPAIR) 

IX=( 2-IP(l))#NPAIR 
IY = ( IP( i )-l) «NPAIR 
WRITE(9,21C) IX, lY 

210 FORMATdX,* INTO MON 0 RE ; I NO EX OF FIRST X , Y: ' , 21 5 ) 

CO 10 I = 1,NPAIR 
X ( I) =LI3T( I + IX ) 

Y( I) = LIST( I + IY) 

K EYX ( I )=I 
KEYY( I )=I 
10 CONTINUE 

CALL RNG(X,K5YX,NPAIR,RX) 

CALL RNQ( Y,K£W,NPAIR,RY) 

SXY=0.0 

SXX=0.0 

CO 15 I=1,NPAIR 
X ( I) = LIST( I-HX ) 

Y ( I) =LIST( I + IY ) 

15 CONTINUE 

CO 20 I=1,NPAIR 

SXY=SXY + RX (I I *RY( I ) 

SXX=SXX+RX (I l*RX( I ) 

2 0 CONTINUE 

TERM2=FN«( (FN-H.O )**2)/4.0 
B2 = ( SXY-TERM2 )/( SXX-TERM2) 

A2 = ( 1.0-62 )*< FN-H.O) / 2.0 

WRITE(NQF , aOI ) TREAT ( IP(D) , TREAT ( ( 3- IP ( 1 ) ) ) , 82 , A2 
101 FORMATdCX,**^MONOTON£ REGR ESSIO N** * / IX, * USING *, 

=4'*L£AST SCLARES REGRESSION ON THE RANKS OF VARIABLES*/ 

♦ lX,'rtITH VARIABLE #* , 13 , * AS DEPENDENT VARIABLE Y*/ 
*1X,’AND VARIAELE it* , 13, • AS INDEPENDENT VARIABLE X’/ 
*1X,» R(Y) = * ,F9.3,* * R(X) +*,F10.3) 

NEST = IP ( 2 ) 
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N=NP ^ IR 

IF ( KEST.GT.O » 

^ CALL ESTY( A2, B2,X , Y, KEYX, K£YY ,RX,RY,N,N£ST ,RP ,NOF ) 
IF ( IPO ) .tC.l J 

* CALL TFAC£(A2fE2,X,Y,RX,RY,K£YX,K£YY,N,N0F) 

RETURN 

END 

SUBROUTINE ESTY 

* ( A2 f 62 iX ,Yf KEYX , KEY Y ,RX,RY, N.NEST, RP,NOF ) 

C THIS FINDS THE ESTIMATES OF THE DEPENDENT VAR Y 
C FOR VALUES OF THE INDEPENDENT VAR X 

REAL A2 ,82,X( 200) ,Y( 200) , RX ( 200 ) , RY ( 200 ) , RP ( 5C ) 
INTEGER KEYXI 200) tKEYY( 200) ,K£Y( 200) ,N£S T ,N , X START , 

* YSTART 

IF ( NEST. EC. 0 ) GO TO 95 
XSTART=0 
CO 5 I»1 ,NEST 
KEY! I ) = I 
5 CONTINUE 

CALL SHSCRTIRF, KEY, NEST) 

WRITE (9,211) RP ( 1) , K£Y( 1) ,NEST 
211 FORMAT! IX, • INTO ESTY: RP ( 1 ) , KEY { 1 ) , NE ST: • ,F 10 . 3 , 21 5 ) 
DO 10 1=1, NEST 
V ALX=RP( I ) 

CALL VTCR (X ,RX , KEY X ,VALX, RNKX, N,XSTART ) 

IF ( RNKX.GE. 1) GO TC 15 

WRITEiNOF ,101 ) I , VALX.X( KEYX (1)),X(KEYX(N)) 
FORMAT( IX ,‘EST .i)*,I3,* NCT POSSIBLE; X=',F10.4/ 
9X,‘ FALLS OUTSIDE RANGE 0F*,F10.4,' TO* ,F10.4) 
GC TO 10 
RNKY=A2+62^RNKX 
YSTART=C 

CALL RTCV (Y,RY,KEYY,VALY,RNKY,N, YSTART) 

WRITEiNOF, 102) I,VALX,VALY 
FORMATdX, 'EST #• , 13, 

': X = ‘,F1Q.4,‘ => E(Y|X) 0FSF1C.4) 

10 CONTINUE 
95 RETURN 

END ' __ __ 

SUBROUTINE VT CR( Z ,RZ , KEYZ , VA LZ , R NKZ , N ,2START ) 

C HAS A VALUE, FINOS ITS RANK 

REAL Z( 20C),RZ(20C), RNKZ,VALZ 
INTEGER KEYZ( 200 ) ,N, ZSTART 
RNKZ=-1. C 

IF(( VALZ.LT.Z (KEYZd ) ) ) .OR. ( VALZ .GT. Z ( KE YZ( N ) )) ) 

* GO TC 95 

CALL INTERF(Z ,RZ , KEY Z , VALZ, RNKZ , N , ZSTART ) 

GO TC 97 

95 WRITE(9,lCi) VALZ , Z( K EY Z (1 ) ) , Z( K EYZ ( N ) ) 

101 FORM AT( IX, 'GUT OF RANGE : VAL , LL, UL :•, 3F10 . ^) 

97 RETURN 
END 

SUBROUTINE RTCV( Z ,RZ , KEYZ ,VALZ, RNKZ, N, ZSTART ) 

C HAS A RANK, FINOS ITS VALUE 

PEAL Z(2CC) ,R2(200 ), RNKZ, VALZ 
INTEGER KEYZ< 200 ) ,N, ZSTART 

IF(( RNKZ.GT.R2(KEYZ( 1 ))) .AND .(RNKZ. LT.RZ ( KEYZ (N) )) ) 

=4= GO TC 50 

I FiRNKZ.LE.RZ ( KEYZd) ) ) V ALZ= Z ( KE Y 2 (1 ) ) 

I FIRNKZ.GE .RZ(KEYZ(N) ) ) V ALZ = Z ( KE YZ ( N ) ) 

GC TO 95 

50 CALL INTEFP(R2,Z , KEYZ, RNKZ, VALZ, N, ZSTART) 

95 RETURN 
END 
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SUBROUTINE IN7ERPIA, 6 ,KEY i A VAL , B V AL ♦ N , I ST ART } 

THIS SUBROUTINE COES LINEAR INTERPOLATION: KEY IS THE 
ORDER OF VALS A AND B* AVAL IS KNOwN, FIND BVAL. ISTART 
IS THE INDEX OF KEY AT WHICH TO START LOOKING: WITH PRIOR 
KNOWLEGEO, SUCH AS IF AVAL'S WERE SORTED, ISTART NAY BE 
GREATER THAN C 

REAL A( 2CU ,B ( 20 0 ) ,A VAL , BVAL , PROP , 8PRCP 
INTEGER I,ISTART,N,KL,KU,KEYI200 i 
1 = I START 
CONT INUE 
I =I-*-l 
KU=KEYm 

IFU AVAL.GT.AIKU) ).AND.( I .LT.N)J GO TO 10 
KL=KEYC t I-i) ) 

PROP = (AV AL-A( KLi ) / (A (KU »-A(KLn 

BPROP=PRCf=^(B (KU )-B( KU i 

eVAL=BI KU+BPFOP 

ISTART=I-1 

RETURN 

END 

SUBROUTINE TR ACE ( A2 , 82, X , Y, RX ,R Y , KEYX ,KE YY, N , NOF j 
THIS SUBROUTINE FINDS THE TRACE OF THE REGRESICN 
IN GENERAL BY TAKING THE ACTUAL RANK OF Y, FINDING 
THE EXPECTED RANK OF X ANC THE CORRESPONDING VALUE OF X, 
AND THEN PLOTTING THAT X WITH THE ACTUAL VALUE OF Y. 

REAL EX ( 2CCI , EY( 200) ,X( 200) , Y ( 20 0 ) , RX ( 200 J , R Y (200 ) , 
♦RYB, A2,B2,BYS 

INTEGER KEYX( 200 } ,KE YY( 200) , N,NO F 
K=0 

WRITEINOF , 101 ) 

FORMATdOX,' TRACE OF THE MONOTONE REGRESSION '/IX, 
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RYS=A2+e2*(RX(KEYX(l ) )) 

IF(RYX.LT.RY(KEYY(1) ) ) GO TO 5 
K = K+1 

£X(K )=>(KEYX( 1 ) ) 

YSTART=3 

CALL RTCV (Y,RY,KEYY,YY,RYS,N,YSTART) 

EY(K ) =YY 

WRITEiNCF , 102 ) EX(K),EY(K) 

5 CO 10 1 = 1, N 
J = KEYY (I ) 

ERX=(R\(J)-A2)/B2 
XSTART = 0 

CALL RTCV (X,RX,KEYX ,XX,ERX,N,XSTART) 

K = K+ 1 
EX<K)=XX 
E Y(K ) =Y { J) 

WRITE(NCF, 102) EX(K),EY(K) 

10 CONTINUE 

RYB=A2+B2=»(RX(KEYX(N) )) 

IF (RYB.GT,RY( KEYY(N) ) ) GO TO 20 
K = K+1 

EX(K)=XK£'lX(N)) 

YSTART=0 

CALL RTCV (Y,RY,KEYY , YY,RYB,N ,YSTART) 

E Y(K) =YY 

WRITE(NCF, 102 ) EX(K),EY(K) 

102 FORMAT! IX ,2F1C.4) 

20 CALL PLOTKEX ,EY,N,1 ) 

RETURN 

END 

C- 

SUBROUTINE KCLMOG (L 1ST ,LPTR ,TR£ AT, I P ,RP , NOF ) 

C THIS SUBROUTINE FINDS THE TEST STATISTIC FOR THE 
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C KOLMCGCRCV ST^iTJSliC FOR SAMPLES OF UNDER 31 ELEMENTS. 

C DEPENDING ON NULL HYP, THE TEST STATISTIC IS THE LARGEST , 
C SMALLEST OR GREATEST ABSOLUTE DIFFERENCE F(X)-S(X), 

C WHERE F IS THE HYPOTHESISED THEORETICAL CIST AND 
C S IS THE EMPIRICAL CDF. IP ( 1 J = CHOICE OF OISTR; 

C IP(2) IS THE CHOICE OF HYPOTHESIS 

C SEE THE DISTRIBUTION SUBROUTINES FOR MEANINGS OF THE 
C OTHER PARAMETERS RP AND IP. 

REAL LIST(400),S(400 ),F(400) ,RP(50) ,FN,TSTAT 
INTEGER LPTRI 17) , IP( SO) ,NCF , IDI ST ,H YP ,KE Y (400 ) 
WRITE(N0F,101 ) 

101 FORMAT( 1C KOLMOGOROV TEST ***** I 
FYP= IP(2 ) 

N=LPTR(2)-1 
FN=F LCAT (N) 

IDIST = IP (1 ) 

DO 10 I =1 ,N 

S(I)=FLCAT(I)/FN 
KEY( I ) = I 
10 CONTINUE 

CALL SHSORTIL 1ST ,KEY ,N) 

IF(I DIST.eg.l ) CALL NCRMF(LIST,N ,RP, F,NOF ) 

IFU DIST. EC. 2 ) CALL EXPF ( LI ST,N , RP , F ,NOF ) 

IF(I GIST .EG.3 ) CALL UN I F ( LI ST ,N , RP , F ,NOF ) 

IF(I DIST.EG.4 ) CALL ERLF ( LIS T ,N , RP , I P ,F, N CF ) 

IFU CIST. EG. 5 ) CALL WEI 8F ( L I ST, N ,RP , F ,N0 F ) 

CALL FINCT(S , F,N ,TSU P ,TMIN, TMAX , NOF ) 

IF(HYP.EG.i) CALL TH YPA ( TSUP ,NOF ) 

IF(HYP.EC.2) CALL TH YPB ( TMAX , NOF ) 

IF(HYP.EC.3) CALL THYPC(TMIN, NOF ) 

RETURN 

END_ __ __ _ __ 

SUBROUTINE ERLF( L 1ST , N, RP , I P ,F , N OF ) 

C THIS FINCS THE THEORETICAL ERALNG DISTRIBUTION F, WHERE 
C NN=I P( 1) =SHAPE PARAMETER, AND L=RP( 1 ) =SCALE PARAMETER 
REAL LX,LXN,L,X, LIST(400) ,RP ( 50 ) , F ( 400 ) , ON 
INTEGER IP(50 ) ,I ,N,FACT ,NNM1 
NN=I P(l) 

NNM1=NN-1 
CO 30 1=1, N 
FACT=1 
X=LIST ( I ) 

LX=L«X 

LXN=0 



CO 23 J=1,NFM1 
FACT=FACT* J 
LXN = LXN-KLX^*J )/FACT 
23 CONTINUE 

IF (X.LE.O) F(I)=0 

IF (X.GT.O) F( I )=1 .0-(1.0 + LXN)*EXP(-LX) 

30 CONTINUE 

WRIT6(NOF,101) NN,L 

101 FORM AT(1X , 'WITH HYPOTHESISED DI S TRI BUTUI ON ERLANG'/IX 
♦’WITH PARAMETERS N=* ,I5,» AND LAM80A= • , F 10.4 ) 

RETURN 

END 

SUBROUTINE WE I8F ( L IS T ,N , RP, F ,NOF ) 

C THIS FINOS THE THEORETICAL WEIBULL OISTIBUTION F, WHERE 
C A=RP( 1) =SHAPE PARAMETER, AND L=RP( 2 ) =SCALE PARAMETER 
C LAMEDA 

REALgA^L j>,Ll ST(400) ,RP(50) ,F(400) 

A = RP (1) 

L=RP (2) 

CO 10 1 = 1, N 
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X=LIST(I ) 

I F(X. LE.O) F( I )=0 

I F(X. GT.O» F( I .0-EXP(-{ IL*X)=<‘=<=A) » 

10 CONTINUE 

WRIT EINOF, 101 ) A,L 

101 FORMAT! IX, • WITH HYPOTHESISED DISTRIBUTION wEIEULL'/iX, 
«'WITH PARAMETERS ALP hA= • , F10.4, ' AND LAMBDA= • ,F1 0.41 
RETURN 

END _ __ — 

SUBROUTINE UN IF( L 1ST , N, R P , F , NOF ) 

C THIS SUBROUTINE FINDS THE THEORETICAL UNIFORM DISTRIBUTION 
C F, WHERE A=RP (1)=L0WER BOUND, B= RP( 2 )=UP P ER BOUND 
REAL C,A ,B,X,LIST(40 0),RP{50),F(400) 

INTEGER I 
A = RP (IJ 
e=RP (2) 

C=6-A 

CO 10 1 = 1, N 
X=LIST(I» 

I F (X.LE.A ) F{ I»=0 
I F (X.GE.B ) F( I)=l 

I F(( X.GT. A ).ANC. ( X. LT.B) » F( I ) = (X-A)/C 
10 CONTINUE 

WRITE(NOF ,101 ) A,e 

101 FORM AT( IX, 'WITH HYPOTHESISED DI S TRI BUTUI ON UNIFORM', 
*/lX, 'WITH PARAMETERS LOWER L IMIT= ‘ , F10.4, 

* • AND UPPER LIMI T=* ,F10.4) 

RETURN 

END 

C-/-/ — — 
SUBROUTINE LIlIEF (L 1ST , LPTR , TRE AT , I P ,RP , NOF ) 

C THIS SUBRCUTINE FINDS THE TEST STATISTIC FOR THE 

C LILLIEFORS TEST FCR SAMPLES OF UNDER 31 ELEMENTS. 

C DEPENDING ON HYPOTHESIS; TEST STATISTIC IS THE LARGEST 
C SMALLEST OR GREATEST ABSOLUTE DIFFERENCE F(XI-S(X), 

C WHERE F IS THE HYPOTHESISED THEORETICAL DISTRIBUTION AND 
C S IS THE EMPIRICAL CDF. IP(l) IS THE CHOICE OF 
C DISTRIBUTIONS; IP (2) IS CHOICE OF HYPOTHESIS. PARAMETERS 
C ARE ESTIMATED: RP(1)=SAMPL5 MEAN ,RP ( 2 J=VARIANC£ 

C SEE THE DISTRIBUTION SUBfiCUTINcS FOR THE MEANINGS OF THE 
C OTHER PARAMETERS FP AND IP. 

REAL LIST(400 ),S <400 » , F ( 400 ) ,RP < 50 > , FN, TSTAT , MU , 

^TMIN ,TMAX, VAR ,TSUP 

INTEGER LPTR< 17) , IP( 50) ,NOF, IDIST,HYP , KEY (400) 

HYP=IP(2 ) 

N=LPTR( 2)"1 
FN=F LCAT (N ) 

CALL MUVAFILI ST,N,MU ,VAR) 

WRIT E(NO F , 101 ) MU , VAR 

101 FO RM AT (1 OX, LILLIEFORS TEST ***=**/lX, 

*'WITH COMPUTED SAMPLE MEAN=* ,F10 .4, 

* ' , AND VAR=* ,F10.4) 

RP (1 ) =MU 

RP (2 )=VAR 
IDIST=IP (1 ) 

IF(I 0IST.EC.2 ) RP(l)=i.O/MU 
DC 10 1 = 1, N 

S (I) = FLCAT (D/FN 
KEY! I ) = I 
10 CONTINUE 

CALL SHSCPT(LIST ,KEY ,N) 

IFdOIST.EQ. 1) CALL NCRMF (L I ST, N ,RP , F ,NOF ) 

IF(I CIST .5G.2 I CALL EXP F ( LI S T,N , RP , F ,NOF ) 

CALL FINCT(S,P,N,TSUP,TMIN,TMAX ,NOF) 

IF(HYP.EG.i) CALL TH YPA ( TSUP ,NOF ) 

IF(HYP.EC.2) call THYP8{TMAX,N0F ) 
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IF(riYP,£C.3) CALL TH YPC ( TMI N , NQF ) 

RETURN 

END __ __ 

SUBROUTINi WILKSH (L I ST , LPTR , TRE AT , I P ,RP , NOF J 
C THIS SUBROUTINE FINOS THE TEST STATISTIC FCR THE 
C SHAPIRO-WILK TEST FOR NORHALITY, N < 31 ELEMENTS. 

C IP(1)= NUMBER CF COMPARI SQNS= TRUNCI# DATA ELEMS/2) 

C RPm t.. . tRP(K)= WEIGHTS TO SCALE COMPARISONS 
C THE CENOMINATCR IS THE SUM OF SQUARED OEVATICNS CF X;AND 
C HERE IT IS FOUND EY SXX= ( N- 1 ) *VAR ( X ) 

C THE NUMERATOR IS SQAURE CF SUMS OF COMPARISONS OF VALUES 
C OF EQUAL DEPTH FRCM END POINTS, E.G., WITH N CATA ELMS, 

C X(N )-X(l ) ,X( N-D-XZ ) ; A(I) IS MULTIPLIED TIMES THE 
C DIFFERENCE 

REAL LIST(400),S( AOO ) ,F(4001 ,RP( 50) , FN , T STAT , MU , VAR , 

«NUM, DENOM,MU, VAR, TSUP,SUMWT 
INTEGER LFTR( 17) , IP{ 50) ,NGF, IDIST ,HYP ,K, KEY! 4C0 J 
N=LPTR( 2)-i 

CALL MUVAF(LIST, N,MU , VAR) 

DEN0M=FLCA7(N-1) *VAR 
CO 5 I = 1 , N 
KEY( I ) = I 
5 CONTINUE 

CALL SHSOPTI L 1ST ,KEY , N) 

K=IP (1) 

SUMWT=0. C 
CO 10 1 = 1, K 

TEMP = RF(1 J ♦(LI ST( N-I+1)-LIST( D) 

WRITE( «,21C) RP(1 ) ,LI SKN-I + l ) ,LIST( I ) ,TEMf 
210 FCRMATdX, »RP ,LISThI ,LISTLO,TEMP ',4F1C.4) 

SUMWT=SLMWT+TEMP 
10 CONTINUE 

NUM= SUMWT*SUMWT 
T3=NUM/0ENCM 

WRIT£(NQF,101 ) N , MU, VAR , T3, NUM, DENOM 
101 F0RMAT(1CX,'4=**# SHAPIRC-WILK TEST FOR NORMALITY 

♦/IX, ‘WITH A SAMPLE SIZE OF' , 1 5/1 X , • W I TH COMPUTED >, 

♦ •SAMPLE ME4N=*,F10.4, •, AND VAR= ' , FIO .4/ IX, . 

♦ 'THE TEST STA7ISTIC=* ,F10.4/1X, 

♦'FROM NUMERATOR' , FIO .4, • , AND DE NOMI NATOR=* , F 10. 5 ) 
RETURN 

END _ 

SUBROUTINE SMIRN2 ( L I ST , LPTR , TRE AT , I P ,RP , NOF ) 

C THE SUBROUTINE TAKES ELEMENTS OF TWO V AR I A BLE S , SORTS 
C THEM, ANC C0MF4IRS THEIR EMPIRICAL CDF'S (FOUND BY 
C DIVIDING THE RANK MINUS BY THE TOTAL NUMBER CF ELEMENTS) 

C I,M REFER TO FIRST VAR X; J,N REFER TO SECOND VAR Y 
C K IS COUNTER FCR CIFFERENCE VECTOR S, L= 1 MEANS LAST 
C PASS IP(1) IS THE INDEX OF HYPOTHESIS TYPE: 1=H0: NO OIF 
C 2=H0:F(X )<=G( V), 2=H0:F( X)=>G(Y) 

REAL EPSILN, FN,FM,X (100) ,Y( 100) ,DIF,ADIF,S(2C0) ,TMIN, 

♦TMAX,ATMAX,ATMIN,TSUP,POIF,LIST(400) ,RP(50) ,TSTAT 
INTEGER I,J,K,L,M,N,KEYI(iOO),KEYJ( 100) , JJ, IP(50), 

♦ LPTR(17) ,NOF ,HYP,I I , TREATd 7) 

CALL FINCS(LIST,LPTR ,S,K) 

TMIN=2.0 

TMAX=-2. C 
HYP= IP( 1 ) 

CO 35 1 = 1, K 

IF ( S( 1 ).GT.TMAX) TMAX=S(I) 

IF { S( D.LT.TMIN) TMIN=S(I) 

35 CONTINUE 

ATMIN=ABS(TMIN) 

ATMAX=ABS (TMA> ) 

TSUP=ATMIN 
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IF ( ATMAX.GT. 7SUP ) TSUP=ATMAX 

V^RITE (NGF,102) TREA T (1 J ♦ TRE AT( 2 ) , TM I N ,TM AX ♦ T SUP 

102 FORM ,-aT<l OX, •♦**** KOLMOLGOROV TEST 

*• WITH DATA SET VAR #*,I3,' AS VARIABLE X */6X, 

*• AND VAR ;**,I3f* AS VARIABLE Y'/IX, 

*• THE MAX DIFFERENCE F( X )<G( Y ): • , FIO .A/IX , 

THE MAX DIFFERENCE F( X ) >G ( Y ) : • ,F1 0. 4/1 X , 

THE OVERALL GREATEST OIFFERENC E: * , FIO. 4 J 
IF(HYP.EC.l) TSTAT=TSUP 
IF(HYP.EQ,2J TSTAT=TMAX 
IF(HYH.EC.3) TSTAT=TMIiN 
WRITE(NGF,i03 ) TSTAT 

103 FORM AT( IX/IX, 'TEST S TATI STI C ; • , F 10. 4 ) 

RETURN 

END __ __ 

SU8R0UTI NE CRVONM (LI ST, LPTR, TREAT, IP , RP, NCF J 
C THIS SUBFCUTINE FINDS THE TEST STATISTIC T2 CF THE 

C CRANER-VON MISES TEST, AND DOES LINEAR INTERPOLATION 
C TO ARRIVE AT THE ALPHA HAT, ALPHAT. 

REAL EPSILN,FN,FM,S (400) ,SSUM,LSIG, ALPHAT, ALFHYP, 

♦ T2 ,ATMAX,ATHN,TSUP,PDIF,LIST(400) ,RP( 50) , CONST 
INTEGER J ,K,L ,M, N , K5 Y I ( 100) , KEY J (100) , 1 1 , JJ , I F ( 50 ) , 

♦ LPTRdT) ,N0F,I,TR£AT(17) 

CALL FINCS(LIST,LPTR ,S,K) 

M=LPTR( 2)-l 

N = LPTR( 3 )-LPTF(2 ) 

CONS T=FL CAT (M^N) /FLOAT! (M+N) »(M+N) ) 

ALPHYP=1.G-RP(1) 

SSUM=0.0 
DO 3 5 I = 1,K 

SSUM = SSUM+(S( I )«S( I) ) 

35 CONTINUE 

T2=CGNST4>SSUM 

WRITE(9,121) ^,N,K, CONST, SSUM,T2 
121 F0R^^AT(1X,••M, N,K, CONST, SSUM,T2:* ,5I4,3F10.4) 

CALL CRVALF( T2, ALPHAT ) 

WRITE (NCF,102» T2 .ALPHAT ,ALPHYP 
102 FORMAT! 1 OX,' CRAMER-VON MISES TEST 
*• THE TEST STATISTIC T2 : • ,F 1 0.4/ IX, 

♦ • GIVES THE ALPHA HAT OF : • , F 10. 4 /iX , 

COMPARED TO THE HYPOTHESISED A LPHA : • , F 1 0. 4/ IX , 

♦ • NULL HYP HO: F{X)=G(X) FOR ALL X',/1X, 

4* ALT HYP Hi: F(XK>G(X) FOR AT LEAST ONE X*) 

IF { ALPHYP.LE .ALPHAT ) WRITE (NOF.lll) 

IF ( ALPHYP.GT .ALPHAT ) WRITE (N0F,112) 

111 FORMATdX,' DC NOT REJECT THE NULL HYPOTHESIS') 

112 FORMATdX,' REJECT THE NULL HYPOTHESIS') 

RETURN 

END 

SUBROUTINE CRVALF(T2, ALPHAT) 

REAL W( 14) ,ALF( 14) .ALPHAT, PART, T2.DE ,NU, LSIG 

Wd) =0.0 

M2) =0. 04 

W( 3) =0. 062 

W(4) =0. 079 

W( 5) =0. 097 

W(6) =0.119 

M7) =0.147 

W{ 8) =0. 134 

W(9) =0.241 

M10)=0. 347 

Wdl )=0. 461 

W(12 ) =0. 74 3 

W( 13 )=1. 166 

W(14 )=99 
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ALF( 1) = 0.C 
ALF( 2) = 0 .1 
ALF( 3)=0,2 
ALF( 41 = 0 .2 
ALF( 5) = 0 .4 
ALF( 6) = 0. 5 
ALF( 7) = 0.6 
ALF( 8)=0.7 
ALF( 9) = 0.£ 

ALF{ 10)=0.9 
ALF( 11) = C. 95 
ALF( 12) = C.99 

13) =0.9999 

14) = 1.C 



ALF( 
ALF( 
1=0 
1 = 1 + 
IF ( 



(T2. GT.rt( D) .AN0.{I.LE.13)) GO TO 10 
IMi= I-i 
NU=T2-W( INI) 

CE=W (D-WdMl ) 

FART=NU/ CE 

LSIG = ALF (INI) +PART*( ALF ( I )-A LF( I Nl) ) 
ALPHAT=1 .C-LS IG 
URITE (9,131) I, PART 
' W(IMl) ,k(I) ,NU,0E 
131 FGRMAT(1)(, • I, FART.T2 
■ ' ,NU ,0E* 



T2,ALF( IMl) ,ALF(I),ALPHAT, 



,ALF-ALF,ALPHAT' 

,4F10.4) 



, 13, 5F10.4/1X, 

♦ • W( IMl ) ,W( I ) 

RETURN 
END 

SUBROUTINE UTEST 



PURPOSE 

TEST WHETHER TWO INDEPENDENT GROUPS AFE FROM SAME 
POPULATION BY MEANS OF MANN-WHITNEY U-TEST 

USAGE 

CALL UTEST(A,R ,N1,N2,U,Z, lER) 

DESCRIPTION OF PARAMETERS 

A - INPUT VECTOR OF CASES CCNSISTING OF TWO INDEPENDENT 
GROUPS. SMALLER GROUP PRECEDES LARGER GRCUP. LENGTH 
IS N1+N2. 

R - OUTPUT VECTOR OF RANKS. SMALLEST VALUE IS RANKED 1. 

LARGEST IS RANKED N. TIES ARE ASSIGNED AVERAGE OF TIE 
RANKS. LENGTH IS N1 +N2 . 

N1 - NUMBER CF CASES IN SMALLER GROUP. 

N2 - NUMBER OF CASES IN LARGER GROUP. 

L - STATISTIC USED TO TEST HOMOGENEITY OF THE TWO 

GROUPS (OUTPUT). 

Z - MEASURE CF SIGNIFIGANCE OF U IN TERMS OF NORMAL 
DISTRIBUTION (OUTPUT). 
lER- 0, IF NC ERROR. 

- 1, IF ALL VALUES OF ONE GROUP ARE TIED. 

REMARKS 

Z IS SET TO ZERO IF N2 IS LESS THAN 20 

SUBROUTINES AND FUNCTION SUBPROGRAMS REQUIRED 
RANK 



METHOD 

DESCRIBED IN S. SIEGEL, 'NONPARAMETRIC STATISTICS FOR THE 
BEHAVIORAL SC lENCES * , M CGRAW-HI LL, NEW YORK, 1956, 

CHAPTER 6 
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SUBROUTINE UT EST ( A ,R , M , N2,U ,Z , I ERi 
DIMENSION A( 1 ),R ( 1) 

RANK SCORES FPCM EOTH GROUP TOGETHER IN ASCENDING ORDER, A 
ASSIGN TIED OBSERVATIONS AVERAGE OF TIED RANKS 

N=N1+N2 

CALL RANK(A,R,N) 

Z=0. 0 

SUM RANKS IN LARGER GROUP 

P2=0 .0 
NP=N1+1 
CO iO I=NP,N 
10 P2=R2+R( I) 

CALCULATE L 

FNX=N1*N2 

FN=N 

FN2=N2 

UP=FNX+FN2^( ( FN2 + 1.0 )/2.0J-R2 
U=FNX-UP 

If(UP-UJ 2C,3C,30 
20 U=UP 

TEST FCR N2 LESS THAN 20 
30 IFIN2-20) aO,AO,4C 

COMPUTE STANDARD DEVIATION 
40 KT=l 

CALL TIE (P ,N, HT,TS) 

IF(TS) 5C,fcC,fO 

50 IF ( TS-( FN. + FN^FN-FN) /12)52,51,52 

51 I5R=1 

GO TO 80 

52 S=SQRT( ( FNX/( fN# ( FN- 1 .0 )) )* ( ( ( FN «FN«FN-FN )/12.01-TSl ) 
GO TO 70 

60 S=SQRT( FNX=»(FN+1.0)/12.0) 

COMPUTE Z 

70 Z=(U-FNX«C.5) /S 
80 RETURN 
END 



C 

SUBROUTINE RANK(A,R,N> 

DIMENSION A( 1),R( 1) 

C I NITI ALIZATION 

DO 10 I=1,N 

10 R(I)=0.0 

C FIND RANK CF DATA 

DO 100 I=1,N 

C TEST WHETHER DATA POINT IS ALREADY RANKED 

IF(R (I) I ZCf 20, 100 
C DATA POINT TO BE RANKED 

20 SMALL=0.0 
EQUAL=0. C 
X=A( I ) 

DO 50 J = 1 ,N 

IF(A ( J»-X) 30 , 40, 50 

C COUNT NUMBER OF DATA POINTS WHICH ARE SMALLER 
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30 SMALL=SMALL + 1 .0 



C 



GO TC 50 

COUNT ^UMBER OF DATA POINTS WHICH ARE EQUAL 



40 EQUAL=EQUAL + 1 .0 



rs I “ JL • 

50 CONTINUE 

TEST FCR TIE 



R( Jl =-!• C 



C 



IF(EQUAL-l.C) 60t 60 t 70 

STORE RANK OF DATA PCINT WHERE NO TIE 

60 R( I) =SMALL + 1 ,C 
GO TC IOC 

CALCULATE RANK OF TIED DATA POINTS 
70 F=SMALL + (EQUAL + 1.0)«0,5 
CO 9C J=I tN 

IF(R(J) + 1.0) 90f 80, 90 

80 R( J) =P 
90 CONTINUE 
100 CONTINUE 
RETURN 
END 



SUBROUTINE TI E (R , N ,KT ,T ) 

DIMENSION R(1 J 
I NITI ALIZA7I0N 
T=0. 0 

y=o. 0 

5 > = 1. 0E38 
IND=0 

FIND NEXT LARGEST RANK 

DO 30 1=1 ,N 
IF(Rd)-Y) 30,30,10 
10 IF(Rd)-X) 20,30,30 
20 X=R( I ) 

IND= INO+1 
30 CONTINUE 

IF ALL RANKS HAVE BEEN TESTED, RETURN 
IFdNDJ 9C,90,40 
40 Y=X 
CT=0.0 

COUNT TIES 
DO 6 0 I =1 ,N 




C 



CALCULATE CORRECTION FACTOR 



75 _ 0 

GO TC 5 

80 T=T+ (CT^S'CT^CT-CT >/12.0 




GO TC 5 
90 RETURN 



END 
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RAM LOCHI (INPUT fOUTPUT ,QINf GOUT , A, B,C,Ot£fF, CPi ; 



APPENDIX G 



PASCAL SOURCE CODE: LOCHINVAR 
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